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INTERLEUKIN-8 HOMOLOGOUS POLYPEPTIDES AND THERAPEUTIC USES 

THEREOF 

This application is a continuing application (filed under 35 U.S.C. § 120) which application claims priority 
to U.S. Provisional Application No.: 60/090696, filed on June 25, 1998; and to which U.S. Provisional 
Application claims priority under 35 U.S.C. §119; and also claims priority to International PCT Application Nos.: 
PCT/US99/ 12252, filed on June 2, 1999; PCT/US00/08439, filed on March 30, 2000; PGT/US00/23328, filed 
on August 24, 2000; and PCT/US0 1/06520, filed on February 28, 2001; to which International PCT Applications 
claim priority under 35 U.S.C. §120; and also claims priority to U.S. Patent Application Nos.: 09/380137, filed 
on August 25, 1999; 09/709238, filed on November 8, 2000; and 09/941992, filed on August 28, 2001; to which 
U.S. Patent Applications claim priority under 35 U.S.C. §120 the entire disclosures of which are hereby 
incorporated by reference. 

FIELD OF THE INVENTION 
The present invention relates generally to the identification and isolation of novel DNA and to the 
15 recombinant production of novel polypeptides having structural homology to the chemokine interleukin-8. 




BACKGROUND OF THE INVENTION 
Extracellular proteins play important roles in, among other things, the formation, differentiation and 
maintenance of multicellular organisms. The fate of many individual cells, e.g., proliferation, migration, 

20 differentiation, or interaction with other cells, is typically governed by information received from other cells 
and/or the immediate environment. This information is often transmitted by secreted polypeptides (for instance, 
mitogenic factors, survival factors, cytotoxic factors, differentiation factors, neuropeptides, and hormones) which 
are, in turn, received and interpreted by diverse cell receptors or membrane-bound proteins. These secreted 
polypeptides or signaling molecules normally pass through the cellular secretory pathway to reach their site of 

25 action in the extracellular environment. 

Secreted proteins have various industrial applications, including as pharmaceuticals, diagnostics, 
biosensors and bioreactors. Most protein drugs available at present, such as thrombolytic agents, interferons, 
interleukins, erythropoietins, colony stimulating factors, and various other cytokines, are secretory proteins. Their 
receptors, which are membrane proteins, also have potential as therapeutic or diagnostic agents. 

30 Membrane-bound proteins and receptors can play important roles in, among other things, the formation, 

differentiation and maintenance of multicellular organisms. The fate of many individual cells, e.g. , proliferation, 
migration, differentiation, or interaction with other cells, is typically governed by information received from other 
cells and/or the immediate environment. This information is often transmitted by secreted polypeptides (for 
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instance, mitogenic factors, survival factors, cytotoxic factors, differentiation factors, neuropeptides, and 
hormones) which are, in turn, received and interpreted by diverse cell receptors or membrane-bound proteins. 
Such membrane-bound proteins and cell receptors include, but are not limited to, cytokine receptors, receptor 
kinases, receptor phosphatases, receptors involved in cell-cell interactions, and cellular adhesin molecules like 
selectins and integrins. For instance, transduction of signals that regulate cell growth and differentiation is 
regulated in part by phosphorylation of various cellular proteins. Protein tyrosine kinases, enzymes that catalyze 
that process, can also act as growth factor receptors. Examples include fibroblast growth factor receptor and 
nerve growth factor receptor. 

Similarly to secreted proteins, membrane-bound proteins and receptor molecules have various industrial 
applications, including as pharmaceutical and diagnostic agents. Receptor immunoadhesins, for instance, can be 
employed as therapeutic agents to block receptor-ligand interactions. The membrane-bound proteins can also be 
employed for screening of potential peptide or small molecule inhibitors of the relevant receptor/ligand interaction. 

Efforts are being undertaken by both industry and academia to identify new, native secreted proteins and 
native receptor or membrane-bound proteins. Many efforts are focused on the screening of mammalian 
recombinant DNA libraries to identify the coding sequences for novel secreted proteins. Examples of screening 
methods and techniques are described in the literature [see, for example, Klein et al. , Proc. Natl. Acad. Sci. , 
93:7108-7113 (1996); U.S. Patent No. 5,536,637)]. 

In this regard, the present invention relates to identifying novel secreted polypeptides of the interleukin-8 
(IL-8) family which have been shown to be related to immune-mediated and inflammatory disease. Immune 
related and inflammatory diseases are the manifestation or consequence of fairly complex, often multiple 
interconnected biological pathways which in normal physiology are critical to respond to insult or injury, initiate 
repair from insult or injury, and mount innate and acquired defense against foreign organisms. Disease or 
pathology occurs when these normal physiological pathways cause additional insult or injury either as directly 
related to the intensity of the response, as a consequence of abnormal regulation or excessive stimulation, as a 
reaction to self, or as a combination of these. 

Though the genesis of immune-related diseases often involves multi-step pathways and often multiple 
different biological systems/pathways, intervention at critical points in one or more of these pathways can have 
an ameliorative or therapeutic effect. Therapeutic intervention can occur by either antagonism of a detrimental 
process/pathway or stimulation of a beneficial process/pathway. 

Many immune related diseases are known and have been extensively studied. Such diseases include 
immune-mediated inflammatory diseases (such as rheumatoid arthritis, immune mediated renal disease, 
hepatobiliary diseases, inflammatory bowel disease (IBD), psoriasis, and asthma), non-immune-mediated 
inflammatory diseases, infectious diseases, immunodeficiency diseases, neoplasia, etc. 

Immune related diseases could be treated by suppressing the immune response. Using neutralizing 
antibodies that inhibit molecules having immune stimulatory activity would be beneficial in the treatment of 
immune-mediated and inflammatory diseases. Molecules which inhibit the immune response can be utilized 
(proteins directly or via the use of antibody agonists) to inhibit the immune response and thus ameliorate immune 
related disease. 



The present invention concerns the identification a novel chemokine which has structural homology to 
interleukin-8 (IL-8). The amino acid sequence between the two proteins is low, however they both have a CXC 
motif, which classifies IL-8 as a member of the CXC chemokine family. Interleukin-8 has been shown to play 
a role in the acute inflammatory response. This response is mediated primarily by TNF-a, IL-1 and IL-6. 
Localized effects include increased adherence of circulating white blood cells to vascular endothelial cells and their 
5 extravasation into tissue spaces. Both IL-1 and TNF-a induce increased expression of cell-adhesion molecules 
(CAMs) on endothelial cells. These two cytokines also induce production of interleukin-8 by macrophages and 
endothelial cells. IL-8 chemotactically attracts neutrophils and promotes their adherence to endothelial cells. 
Specifically, IL-8 chemoattracts monocytes and dendritic cells. Both cell types play an important role in the 
initiation of an immune response. 
10 Since the discovery 13 years ago of interleukin-8 (IL-8) as a potent neutrophil chemotactic factor, 

accumulating evidence has established it as a crucial mediator in neutrophil-dependent acute inflammation. In 
fact, leukocyte infiltration is a hallmark of inflammation. Numerous observations have demonstrated that various 
types of cells can produce a large amount of IL-8, either in response to various stimuli or constitutively, after 
malignant transformation (Mukaida, N. , International Journal of Hematology . 72(4) : 39 1-398 (2000)). The release 

15 of IL-8 is triggered by inflammatory signals from a variety of cells. The diversity in the cellular source indicates 
pleiotrophy of its functions. IL-8 plays a key role in host defense mechanism through its effects on neutrophil 
activation, but a continued presence of IL-8 in circulation in response to inflammatory conditions may lead to a 
variable degree of tissue damage. The presence of IL-8 in various pathophysiological conditions implies that 
blockade of its actions could be exploited for therapeutic purposes (Atta-ur-Rahman et al. , Current Pharmaceutical 

20 Design . 5(4) :241-253 (1999)). Recently, IL-8 has been shown to be an autocrine growth factor for human ovarian 
cancer. IL-8 appears to play a direct role in the progressive growth of ovarian cancer cells (Xu, L. and Fidler, 
I. J., Oncology Research . 12(2) :97-106 (2000)). In addition, increased levels of IL-8 have been found in 
bronchoalveolar lavage (BAL) fluids from patient's with acute respiratory distress syndrome (ARDS). The 
presence of anti-IL-8:IL-8 complexes in BAL fluids of patients with ARDS is an important prognostic indicator 

25 for the development and outcome of ARDS (Kurdowska, A. et al., American Journal of Respiratory & Critical 
Care Medicine . 163 (2) : 463 -468 (2001)). 

As discussed above, the class of molecules known as chemokines are a family of proinflammatory 
cytokines of low molecular mass (8-11 kDa) characterized by a structurally conserved motif and their ability to 
mediate leukocyte chemotaxis, thus playing an important role in leukocyte trafficking as well as function in 

30 regulation. It is now clear that these small cytokines play a role in a variety of homostatic and disease processes, 
including development, hematopoiesis, allergies, angiogenesis, and oncogenesis (see Broxmeyer, H. E. et al., L 
Exp. Med. . 170:1583 (1989); Cao, Y. et al., J. Exp. Med. . 182:2069 (1995); and Strieter, R. M. et al., J. Biol. 
Chem. . 270 :27348 (1995)). The majority of chemokines are expressed in response to some stimuli, but several 
are constitutively expressed (Wang, J. M. et al., J. Immunol. Methods . 220:1-17 (1998); Baggiolini, M., Annu. 

35 Rev. Immunol. . 15:675-705 (1997); and Gale, L. M., and McColl, S. R., Bioessavs . 21:17-28 (1999)). In 
addition, several CC chemokines, including RANTES, macrophage inflammatory protein (MIP) 4 - la, and MIP- 
1P, have been found to be capable of inhibiting HIV infection (Cocchi, F. et al., Science . 270:1811 (1995)). 




The chemokine family can be divided into four major subfamilies based on the positions of amino- 
terminal cysteine residues. In the CXC chemokines, the first two cysteines are separated by a non-conserved 
amino acid, while in the CC chemokine subfamily, these two cysteines are adjacent to each other. The C 
chemokine subfamily with the only member of lymphotactin lacks the second and fourth cysteines, which are 
conserved in the CXC and CC chemokines. The CX 3 C membrane-bound chemokines have 3 amino acids between 
5 the first two cysteines, a long mucin-like stalk, and a short transmembrane domain (Bazan, J. F. et al., Nature, 
385:640 (1997); Pan, Y. et al., Nature . 387:611 (1997)). In general, the CXC chemokines primarily recruit 
neutrophils, while the CC chemokines primarily attract monocytes and also lymphocytes, basophils, and/or 
eosinophils with variable selectivity. The C chemokine of lymphotactin seems to act specifically on T lymphocytes 
and NK cells (Kelner, G. S. et al., Science , 266:1395 (1994) and Kennedy, J. G. et al., J. Immunol. , 155:203 



Dendritic cells (DC) are the uniquely potent APCs involved in immune responses (Banchereau, J., and 
Steinman, R. M., Nature , 392 :245 (1998)). As adjuvants for Ag delivery, immature dendritic cells pick up Ags 
in the periphery and carry them to the T cell area in lymphoid organs to prime the immune responses, meanwhile 
undergoing maturation. Thus, chemokines play a vital role in dendritic trafficking, maturation, and function. 



The pattern of cytokines implicated in a DTH response suggest that activated T cells may be primarily of the Thl 
subset. IL-2 functions in an autocrine manner to amplify the population of cytokine-producing T cells. Among 
the cytokines produced by these cells are a number that serve to activate and attract macrophages to the site of Thl 
activation. IL-3 and GM-CSF induce localized hematopoiesis of the granulocyte-monocyte lineage. IFN-y and 

20 TNF-p (together with macrophage-derived TNF-a and IL-1) act on nearby endothelial cells, inducing a number 
of changes that facilitate extravasation of monocytes and other nonspecific inflammatory cells. Among the changes 
induced are increases in the expression of cellular-adhesion molecules including ICAMs, VCAMs, and ELAMSs; 
changes in the shape of the vascular endothelial cells to facilitate extravasation; and secretion of IL-8 and 
monocyte chemotactic factor. Circulating neutrophils and monocytes adhere to the adhesion molecules displayed 

25 on the vascular endothelial cells and extravasate into the tissue spaces. Neutrophils appear early in the reaction, 
whereas the monocyte infiltration occurs later. (See Immunology . Second Edition, Chapter 15, pgs. 363-364 
(copyright 1994) W.H. Freeman and Company publishers). 

Interest in this family of molecules has increased as it has become apparent that chemokines may 
contribute to a number of important medical conditions related to immune function: including rheumatoid arthritis, 

30 immune mediated renal diseases, hepatobiliary diseases, inflammatory bowel disease, psoriasis, asthma, multiple 
sclerosis, atherosclerosis, promotion of tumor growth, or degenerative joint disease. Given the potential of 
chemokine related molecules to occupy important roles in the control of immune function, there is an interest in 
the identification of other members of this family and the receptors that direct the actions of these molecules 
through particular target cell populations. In this respect, the present invention describes the cloning and 

35 characterization of novel proteins (designated herein as "PR0842" polypeptides) that are structurally similar to 
IL-8, and active variants thereof. 
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(1995)). 
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In addition, numerous cytokines play a role in generating a delayed-type hypersensitivity (DTH) response. 



4 



SUMMARY OF THE INVENTION 

A. Embodiments 

The present invention concerns compositions and methods useful for the diagnosis and treatment of 
immune related disease in mammals, including humans. The present invention is based on the identification of 
proteins (including agonist and antagonist antibodies) which either stimulate or inhibit the immune response in 
5 mammals. Immune related diseases can be treated by suppressing or enhancing the immune response. Molecules 
that enhance the immune response stimulate or potentiate the immune response to an antigen. Molecules which 
stimulate the immune response can be used therapeutically where enhancement of the immune response would be 
beneficial. Alternatively, molecules that suppress the immune response attenuate or reduce the immune response 
to an antigen (e.g. , neutralizing antibodies) can be used therapeutically where attenuation of the immune response 

10 would be beneficial (e.g., inflammation). Accordingly, the PR0842 polypeptides of the present invention and 
agonists and antagonists thereof are also useful to prepare medicines and medicaments for the treatment of 
immune-related and inflammatory diseases. In a specific aspect, such medicines and medicaments comprise a 
therapeutically effective amount of a PR0842 polypeptide, agonist or antagonist thereof with a pharmaceutically 
acceptable carrier. Preferably, the admixture is sterile. 

15 In a further embodiment, the invention concerns a method of identifying agonists of or antagonists to a 

PR0842 polypeptide which comprises contacting the PR0842 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said PR0842 polypeptide. Preferably, the PR0842 polypeptide is 
a native sequence PR0842 polypeptide. In a specific aspect, the PR0842 agonist or antagonist is an anti-PR0842 
antibody. 

20 In another embodiment, the invention concerns a composition of matter comprising a PR0842 polypeptide 

or an agonist or antagonist antibody which binds the polypeptide in admixture with a carrier or excipient. In one 
aspect, the composition comprises a therapeutically effective amount of the polypeptide or antibody. In another 
aspect, when the composition comprises an immune stimulating molecule, the composition is useful for: (a) 
enhancing infiltration of inflammatory cells into a tissue of a mammal in need thereof, (b) stimulating or enhancing 

25 an immune response in a mammal in need thereof, (c) increasing the proliferation of T-lymphocytes in a mammal 
in need thereof in response to an antigen, (d) stimulating the activity of T-lymphocytes or (e) increasing the 
vascular permeability. In a further aspect, when the composition comprises an immune inhibiting molecule, the 
composition is useful for: (a) decreasing infiltration of inflammatory cells into a tissue of a mammal in need 
thereof, (b) inhibiting or reducing an immune response in a mammal in need thereof, (c) decreasing the activity 

30 of T-lymphocytes or (d) decreasing the proliferation of T-lymphocytes in a mammal in need thereof in response 
to an antigen. In another aspect, the composition comprises a further active ingredient, which may, for example, 
be a further antibody or a cytotoxic or chemotherapeutic agent. Preferably, the composition is sterile. 

In another embodiment, the invention concerns a method of treating an immune related disorder in a 
mammal in need thereof, comprising administering to the mammal a therapeutically effective amount of a PR0842 

35 polypeptide, an agonist thereof, or an antagonist thereto. In a preferred aspect, the immune related disorder is 
selected form the group consisting of: systemic lupus erythematosis, rheumatoid arthritis, osteoarthritis, juvenile 
chronic arthritis, spondyloarthropathies, systemic sclerosis, idiopathic inflammatory myopathies, Sjogren's 



syndrome, systemic vasculitis, sarcoidosis, autoimmune hemolytic anemia, autoimmune thrombocytopenia, 
thyroiditis, diabetes mellitus, immune-mediated renal disease, demyelinating diseases of the central and peripheral 
nervous systems such as multiple sclerosis, idiopathic demyelinating polyneuropathy or Guillain-Barre syndrome, 
and chronic inflammatory demyelinating polyneuropathy, hepatobiliary diseases such as infectious, autoimmune 
chronic active hepatitis, primary biliary cirrhosis, granulomatous hepatitis, and sclerosing cholangitis, 
inflammatory bowel disease, gluten-sensitive enteropathy, and Whipple's disease, autoimmune or 
immune-mediated skin diseases including bullous skin diseases, erythema multiforme and contact dermatitis, 
psoriasis, allergic diseases such as asthma, allergic rhinitis, atopic dermatitis, food hypersensitivity and urticaria, 
immunologic diseases of the lung such as eosinophilic pneumonia, idiopathic pulmonary fibrosis and 
hypersensitivity pneumonitis, transplantation associated diseases including graft rejection and graft 
-versus-host-disease . 

In another embodiment, the invention provides an antibody which specifically binds to any of the above 
or below described polypeptides. Optionally, the antibody is a monoclonal antibody, humanized antibody, 
antibody fragment or single-chain antibody. In one aspect, the present invention concerns an isolated antibody 
which binds a PR0842 polypeptide. In another aspect, the antibody mimics the activity of a PR0842 polypeptide 
(an agonist antibody) or conversely the antibody inhibits or neutralizes the activity of a PR0842 polypeptide (an 
antagonist antibody). In another aspect, the antibody is a monoclonal antibody, which preferably has nonhuman 
complementarity determining region (CDR) residues and human framework region (FR) residues. The antibody 
may be labeled and may be immobilized on a solid support. In a further aspect, the antibody is an antibody 
fragment, a monoclonal antibody, a single-chain antibody, or an anti-idiotypic antibody. 

In yet another embodiment, the present invention provides a composition comprising an anti-PR0842 
antibody in admixture with a pharmaceutically acceptable carrier. In one aspect, the composition comprises a 
therapeutically effective amount of the antibody. Preferably, the composition is sterile. The composition may 
be administered in the form of a liquid pharmaceutical formulation, which may be preserved to achieve extended 
storage stability. Alternatively, the antibody is a monoclonal antibody, an antibody fragment, a humanized 
antibody, or a single-chain antibody. 

In a further embodiment, the invention concerns an article of manufacture, comprising: 

(a) a composition of matter comprising a PR0842 polypeptide or agonist, antagonist, or an antibody that 
specifically binds to said polypeptide thereof; 

(b) a container containing said composition; and 

(c) a label affixed to said container, or a package insert included in said container referring to the use of said 
PR0842 polypeptide or agonist or antagonist thereof in the treatment of an immune related disease. The 
composition may comprise a therapeutically effective amount of the PR0842 polypeptide or the agonist or 
antagonist thereof. 

In yet another embodiment, the present invention concerns a method of diagnosing an immune related 
disease in a mammal, comprising detecting the level of expression of a gene encoding a PR0842 polypeptide (a) 
in a test sample of tissue cells obtained from the mammal, and (b) in a control sample of known normal tissue cells 
of the same cell type, wherein a higher or lower expression level in the test sample as compared to the control 



sample indicates the presence of immune related disease in the mammal from which the test tissue cells were 
obtained. 

In another embodiment, the present invention concerns a method of diagnosing an immune disease in a 
mammal, comprising (a) contacting an anti-PR0842 antibody with a test sample of tissue cells obtained from the 
mammal, and (b) detecting the formation of a complex between the antibody and a PR0842 polypeptide, in the 
test sample; wherein the formation of said complex is indicative of the presence or absence of said disease. The 
detection may be qualitative or quantitative, and may be performed in comparison with monitoring the complex 
formation in a control sample of known normal tissue cells of the same cell type. A larger quantity of complexes 
formed in the test sample indicates the presence or absence of an immune disease in the mammal from which the 
test tissue cells were obtained. The antibody preferably carries a detectable label. Complex formation can be 
monitored, for example, by light microscopy, flow cytometry, fluorimetry, or other techniques known in the art. 
The test sample is usually obtained from an individual suspected of having a deficiency or abnormality of the 
immune system. 

In another embodiment, the invention provides a method for determining the presence of a PR0842 
polypeptide in a sample comprising exposing a test sample of cells suspected of containing the PR0842 polypeptide 
to an anti-PR0842 antibody and determining the binding of said antibody to said cell sample. In a specific aspect, 
the sample comprises a cell suspected of containing the PR0842 polypeptide and the antibody binds to the cell. 
The antibody is preferably detectably labeled and/or bound to a solid support. 

In another embodiment, the present invention concerns an immune-related disease diagnostic kit, 
comprising an anti-PR0842 antibody and a carrier in suitable packaging. The kit preferably contains instructions 
for using the antibody to detect the presence of the PR0842 polypeptide. Preferably the carrier is 
pharmaceutically acceptable. 

In another embodiment, the present invention concerns a diagnostic kit, containing an anti-PR0842 
antibody in suitable packaging. The kit preferably contains instructions for using the antibody to detect the 
PR0842 polypeptide. 

In another embodiment, the invention provides a method of diagnosing an immune-related disease in a 
mammal which comprises detecting the presence or absence or a PR0842 polypeptide in a test sample of tissue 
cells obtained from said mammal, wherein the presence or absence of the PR0842 polypeptide in said test sample 
is indicative of the presence of an immune-related disease in said mammal. 

In another embodiment, the present invention concerns a method for identifying an agonist of a PR0842 
polypeptide comprising: (a) contacting cells and a test compound to be screened under conditions suitable for the 
induction of a cellular response normally induced by a PR0842 polypeptide; and (b) determining the induction 
of said cellular response to determine if the test compound is an effective agonist, wherein the induction of said 
cellular response is indicative of said test compound being an effective agonist. 

In another embodiment, the invention concerns a method for identifying a compound capable of inhibiting 
the activity of a PR0842 polypeptide comprising contacting a candidate compound with a PR0842 polypeptide 
under conditions and for a time sufficient to allow these two components to interact and determining whether the 
activity of the PR0842 polypeptide is inhibited. In a specific aspect, either the candidate compound or the 



PR0842 polypeptide is immobilized on a solid support. In another aspect, the non-immobilized component carries 
a detectable label. In a preferred aspect, this method comprises the steps of: (a) contacting cells and a test 
compound to be screened in the presence of a PR0842 polypeptide under conditions suitable for the induction of 
a cellular response normally induced by a PR0842 polypeptide; and (b) determining the induction of said cellular 
response to determine if the test compound is an effective antagonist. 

In another embodiment, the invention provides a method for identifying a compound that inhibits the 
expression of a PR0842 polypeptide in cells that normally express the polypeptide, wherein the method comprises 
contacting the cells with a test compound and determining whether the expression of the PR0842 polypeptide is 
inhibited. In a preferred aspect, this method comprises the steps of: (a) contacting cells and a test compound to 
be screened under conditions suitable for allowing expression of the PR0842 polypeptide; and (b) determining 
the inhibition of expression of said polypeptide. 

In yet another embodiment, the present invention concerns a method for treating an immune-related 
disorder in a mammal that suffers therefrom comprising administering to the mammal a nucleic acid molecule that 
codes for either (a) a PR0842 polypeptide, (b) an agonist of a PR0842 polypeptide or (c) an antagonist of a 
PR0842 polypeptide, wherein said agonist or antagonist may be an anti-PR0842 antibody. In a preferred 
embodiment, the mammal is human. In another preferred embodiment, the nucleic acid is administered via ex 
vivo gene therapy. In a further preferred embodiment, the nucleic acid is comprised within a vector, more 
preferably an adenoviral, adeno-associated viral, lentiviral or retroviral vector. 

In yet another aspect, the invention provides a recombinant viral particle comprising a viral vector 
consisting essentially of a promoter, nucleic acid encoding (a) a PR0842 polypeptide, (b) an agonist polypeptide 
of a PR0842 polypeptide, or (c) an antagonist polypeptide of a PR0842 polypeptide, and a signal sequence for 
cellular secretion of the polypeptide, wherein the viral vector is in association with viral structural proteins. 
Preferably, the signal sequence is from a mammal, such as from a native PR0842 polypeptide. 

In a still further embodiment, the invention concerns an ex vivo producer cell comprising a nucleic acid 
construct that expresses retroviral structural proteins and also comprises a retroviral vector consisting essentially 
of a promoter, nucleic acid encoding (a) a PR0842 polypeptide, (b) an agonist polypeptide of a PR0842 
polypeptide or (c) an antagonist polypeptide of a PR0842 polypeptide, and a signal sequence for cellular secretion 
of the polypeptide, wherein said producer cell packages the retroviral vector in association with the structural 
proteins to produce recombinant retroviral particles. 

In a still further embodiment, the invention provides a method for enhancing the infiltration of 
inflammatory cells from the vasculature into a tissue of a mammal comprising administering to said mammal (a) 
a PR0842 polypeptide, (b) an agonist of a PR0842 polypeptide, or (c) an antagonist of a PR0842 polypeptide, 
wherein the infiltration of inflammatory cells from the vasculature in the mammal is enhanced. 

In a still further embodiment, the invention provides a method for decreasing the infiltration of 
inflammatory cells from the vasculature into a tissue of a mammal comprising administering to said mammal (a) 
a PR0842 polypeptide, (b) an agonist of a PR0842 polypeptide, or (c) an antagonist of a PR0842 polypeptide, 
wherein the infiltration of inflammatory cells from the vasculature in the mammal is decreased. 

In a still further embodiment, the invention provides a method of increasing the activity of T-lymphocytes 

8 



in a mammal comprising administering to said mammal (a) a PR0842 polypeptide, (b) an agonist of a PR0842 
polypeptide, or (c) an antagonist of a PR0842 polypeptide, wherein the activity of T-lymphocytes in the mammal 
is increased. 

In a still further embodiment, the invention provides a method of decreasing the activity of T-lymphocytes 
in a mammal comprising administering to said mammal (a) a PR0842 polypeptide, (b) an agonist of a PR0842 
5 polypeptide, or (c) an antagonist of a PR0842 polypeptide, wherein the activity of T-lymphocytes in the mammal 
is decreased. 

In a still further embodiment, the invention provides a method of increasing the proliferation of 
T-lymphocytes in a mammal comprising administering to said mammal (a) a PR0842 polypeptide, (b) an agonist 
of a PR0842 polypeptide, or (c) an antagonist of a PR0842 polypeptide, wherein the proliferation of 
10 T-lymphocytes in the mammal is increased. 

In a still further embodiment, the invention provides a method of decreasing the proliferation of 
T-lymphocytes in a mammal comprising administering to said mammal (a) a PR0842 polypeptide, (b) an agonist 
of a PR0842 polypeptide, or (c) an antagonist of a PR0842 polypeptide, wherein the proliferation of 
T-lymphocytes in the mammal is decreased. 
15 In still a further embodiment, the invention relates to a kit comprising a composition comprising a 

PR0842, or an agonist or antagonist thereof, in admixture with a pharmaceutical^ acceptable carrier; a container 
containing said composition; and a label affixed to said container, referring to the use of said composition, in the 
treatment of a degenerative cartilaginous disorder. 

20 B. Additional Embodiments 

In other embodiments of the present invention, the invention provides an isolated nucleic acid molecule 
comprising a nucleotide sequence that encodes a PR0842 polypeptide. 

In one aspect, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about 
80% nucleic acid sequence identity, alternatively at least about 81% nucleic acid sequence identity, alternatively 

25 at least about 82% nucleic acid sequence identity, alternatively at least about 83% nucleic acid sequence identity, 
alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 85% nucleic acid 
sequence identity, alternatively at least about 86% nucleic acid sequence identity, alternatively at least about 87% 
nucleic acid sequence identity, alternatively at least about 88% nucleic acid sequence identity, alternatively at least 
about 89% nucleic acid sequence identity, alternatively at least about 90% nucleic acid sequence identity, 

30 alternatively at least about 91% nucleic acid sequence identity, alternatively at least about 92% nucleic acid 
sequence identity, alternatively at least about 93% nucleic acid sequence identity, alternatively at least about 94% 
nucleic acid sequence identity, alternatively at least about 95 % nucleic acid sequence identity, alternatively at least 
about 96% nucleic acid sequence identity, alternatively at least about 97% nucleic acid sequence identity, 
alternatively at least about 98% nucleic acid sequence identity and alternatively at least about 99% nucleic acid 

35 sequence identity to (a) a DNA molecule encoding a PR0842 polypeptide having a full-length amino acid sequence 
as disclosed herein, an amino acid sequence lacking the signal peptide as disclosed herein, an extracellular domain 
of a transmembrane protein, with or without the signal peptide, as disclosed herein or any other specifically 




defined fragment of the full-length amino acid sequence as disclosed herein, or (b) the complement of the DNA 
molecule of (a). 

In other aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about 
80% nucleic acid sequence identity, alternatively at least about 81 % nucleic acid sequence identity, alternatively 
at least about 82% nucleic acid sequence identity, alternatively at least about 83% nucleic acid sequence identity, 
5 alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 85% nucleic acid 
sequence identity, alternatively at least about 86% nucleic acid sequence identity, alternatively at least about 87% 
nucleic acid sequence identity, alternatively at least about 88% nucleic acid sequence identity, alternatively at least 
about 89% nucleic acid sequence identity, alternatively at least about 90% nucleic acid sequence identity, 
alternatively at least about 91% nucleic acid sequence identity, alternatively at least about 92% nucleic acid 

10 sequence identity, alternatively at least about 93% nucleic acid sequence identity, alternatively at least about 94% 
nucleic acid sequence identity, alternatively at least about 95% nucleic acid sequence identity, alternatively at least 
about 96% nucleic acid sequence identity, alternatively at least about 97% nucleic acid sequence identity, 
alternatively at least about 98% nucleic acid sequence identity and alternatively at least about 99% nucleic acid 
sequence identity to (a) a DNA molecule comprising the coding sequence of a full-length PR0842 polypeptide 

15 cDNA as disclosed herein, the coding sequence of a PR0842 polypeptide lacking the signal peptide as disclosed 
herein, the coding sequence of an extracellular domain of a transmembrane PR0842 polypeptide, with or without 
the signal peptide, as disclosed herein or the coding sequence of any other specifically defined fragment of the 
full-length amino acid sequence as disclosed herein, or (b) the complement of the DNA molecule of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising a nucleotide 

20 sequence having at least about 80% nucleic acid sequence identity, alternatively at least about 81% nucleic acid 
sequence identity, alternatively at least about 82% nucleic acid sequence identity, alternatively at least about 83% 
nucleic acid sequence identity, alternatively at least about 84% nucleic acid sequence identity, alternatively at least 
about 85% nucleic acid sequence identity, alternatively at least about 86% nucleic acid sequence identity, 
alternatively at least about 87% nucleic acid sequence identity, alternatively at least about 88% nucleic acid 

25 sequence identity, alternatively at least about 89% nucleic acid sequence identity, alternatively at least about 90% 
nucleic acid sequence identity, alternatively at least about 91 % nucleic acid sequence identity, alternatively at least 
about 92% nucleic acid sequence identity, alternatively at least about 93% nucleic acid sequence identity, 
alternatively at least about 94% nucleic acid sequence identity, alternatively at least about 95% nucleic acid 
sequence identity, alternatively at least about 96% nucleic acid sequence identity, alternatively at least about 97% 

30 nucleic acid sequence identity, alternatively at least about 98% nucleic acid sequence identity and alternatively at 
least about 99 % nucleic acid sequence identity to (a) a DNA molecule that encodes the same mature polypeptide 
encoded by any of the human protein cDNAs deposited with the ATCC as disclosed herein, or (b) the complement 
of the DNA molecule of (a). 

Another aspect of the present invention provides an isolated nucleic acid molecule comprising a nucleotide 

35 sequence encoding a PR0842 polypeptide which is either transmembrane domain-deleted or transmembrane 
domain-inactivated, or is complementary to such encoding nucleotide sequence, wherein the transmembrane 
domain(s) of such polypeptide are disclosed herein. Therefore, soluble extracellular domains of the herein 
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described PR0842 polypeptides are contemplated. 

Another embodiment is directed to fragments of a PR0842 polypeptide coding sequence, or the 
complement thereof, that may find use as, for example, hybridization probes, for encoding fragments of a PR0842 
polypeptide that may optionally encode a polypeptide comprising a binding site for an anti-PR0842 antibody or 
as antisense oligonucleotide probes. Such nucleic acid fragments are usually at least about 20 nucleotides in 
5 length, alternatively at least about 30 nucleotides in length, alternatively at least about 40 nucleotides in length, 
alternatively at least about 50 nucleotides in length, alternatively at least about 60 nucleotides in length, 
alternatively at least about 70 nucleotides in length, alternatively at least about 80 nucleotides in length, 
alternatively at least about 90 nucleotides in length, alternatively at least about 100 nucleotides in length, 
alternatively at least about 110 nucleotides in length, alternatively at least about 120 nucleotides in length, 

10 alternatively at least about 130 nucleotides in length, alternatively at least about 140 nucleotides in length, 
alternatively at least about 150 nucleotides in length, alternatively at least about 160 nucleotides in length, 
alternatively at least about 170 nucleotides in length, alternatively at least about 180 nucleotides in length, 
alternatively at least about 190 nucleotides in length, alternatively at least about 200 nucleotides in length, 
alternatively at least about 250 nucleotides in length, alternatively at least about 300 nucleotides in length, 

15 alternatively at least about 350 nucleotides in length, alternatively at least about 400 nucleotides in length, 
alternatively at least about 450 nucleotides in length, alternatively at least about 500 nucleotides in length, 
alternatively at least about 600 nucleotides in length, alternatively at least about 700 nucleotides in length, 
alternatively at least about 800 nucleotides in length, alternatively at least about 900 nucleotides in length and 
alternatively at least about 1000 nucleotides in length, wherein in this context the term "about" means the 

20 referenced nucleotide sequence length plus or minus 10% of that referenced length. It is noted that novel 
fragments of a PR0842 polypeptide-encoding nucleotide sequence may be determined in a routine manner by 
aligning the PR0842 polypeptide-encoding nucleotide sequence with other known nucleotide sequences using any 
of a number of well known sequence alignment programs and determining which PR0842 polypeptide-encoding 
nucleotide sequence fragment(s) are novel. All of such PR0842 polypeptide-encoding nucleotide sequences are 

25 contemplated herein. Also contemplated are the PR0842 polypeptide fragments encoded by these nucleotide 
molecule fragments, preferably those PR0842 polypeptide fragments that comprise a binding site for an 
anti-PR0842 antibody. 

In another embodiment, the invention provides an isolated PR0842 polypeptide encoded by any of the 
isolated nucleic acid sequences hereinabove identified. 

30 In a certain aspect, the invention concerns an isolated PR0842 polypeptide, comprising an amino acid 

sequence having at least about 80% amino acid sequence identity, alternatively at least about 81% amino acid 
sequence identity, alternatively at least about 82% amino acid sequence identity, alternatively at least about 83% 
amino acid sequence identity, alternatively at least about 84% amino acid sequence identity, alternatively at least 
about 85% amino acid sequence identity, alternatively at least about 86% amino acid sequence identity, 

35 alternatively at least about 87 % amino acid sequence identity, alternatively at least about 88 % amino acid sequence 
identity, alternatively at least about 89% amino acid sequence identity, alternatively at least about 90% amino acid 
sequence identity, alternatively at least about 91 % amino acid sequence identity, alternatively at least about 92% 
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amino acid sequence identity, alternatively at least about 93% amino acid sequence identity, alternatively at least 
about 94% amino acid sequence identity, alternatively at least about 95% amino acid sequence identity, 
alternatively at least about 96% amino acid sequence identity, alternatively at least about 97% amino acid sequence 
identity, alternatively at least about 98% amino acid sequence identity and alternatively at least about 99% amino 
acid sequence identity to a PR0842 polypeptide having a full-length amino acid sequence as disclosed herein, an 
5 amino acid sequence lacking the signal peptide as disclosed herein, an extracellular domain of a transmembrane 
protein, with or without the signal peptide, as disclosed herein or any other specifically defined fragment of the 
full-length amino acid sequence as disclosed herein. 

In a further aspect, the invention concerns an isolated PR0842 polypeptide comprising an amino acid 
sequence having at least about 80% amino acid sequence identity, alternatively at least about 81% amino acid 

10 sequence identity, alternatively at least about 82% amino acid sequence identity, alternatively at least about 83% 
amino acid sequence identity, alternatively at least about 84% amino acid sequence identity, alternatively at least 
about 85% amino acid sequence identity, alternatively at least about 86% amino acid sequence identity, 
alternatively at least about 87% amino acid sequence identity, alternatively at least about 88% amino acid sequence 
identity, alternatively at least about 89% amino acid sequence identity, alternatively at least about 90% amino acid 

15 sequence identity, alternatively at least about 91 % amino acid sequence identity, alternatively at least about 92% 
amino acid sequence identity, alternatively at least about 93% amino acid sequence identity, alternatively at least 
about 94% amino acid sequence identity, alternatively at least about 95% amino acid sequence identity, 
alternatively at least about 96% amino acid sequence identity, alternatively at least about 97% amino acid sequence 
identity, alternatively at least about 98% amino acid sequence identity and alternatively at least about 99% amino 

20 acid sequence identity to an amino acid sequence encoded by any of the human protein cDNAs deposited with the 
ATCC as disclosed herein. 

In a further aspect, the invention concerns an isolated PR0842 polypeptide comprising an amino acid 
sequence scoring at least about 80% positives, alternatively at least about 81 % positives, alternatively at least about 
82% positives, alternatively at least about 83% positives, alternatively at least about 84% positives, alternatively 

25 at least about 85% positives, alternatively at least about 86% positives, alternatively at least about 87% positives, 
alternatively at least about 88% positives, alternatively at least about 89% positives, alternatively at least about 
90% positives, alternatively at least about 91% positives, alternatively at least about 92% positives, alternatively 
at least about 93% positives, alternatively at least about 94% positives, alternatively at least about 95% positives, 
alternatively at least about 96% positives, alternatively at least about 97% positives, alternatively at least about 

30 98% positives and alternatively at least about 99% positives when compared with the amino acid sequence of a 
PR0842 polypeptide having a full-length amino acid sequence as disclosed herein, an amino acid sequence lacking 
the signal peptide as disclosed herein, an extracellular domain of a transmembrane protein, with or without the 
signal peptide, as disclosed herein or any other specifically defined fragment of the full-length amino acid sequence 
as disclosed herein. 

35 In a specific aspect, the invention provides an isolated PR0842 polypeptide without the N-terminal signal 

sequence and/or the initiating methionine and is encoded by a nucleotide sequence that encodes such an amino acid 
sequence as hereinbefore described. Processes for producing the same are also herein described, wherein those 
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processes comprise culturing a host cell comprising a vector which comprises the appropriate encoding nucleic 
acid molecule under conditions suitable for expression of the PR0842 polypeptide and recovering the PR0842 
polypeptide from the cell culture. 

Another aspect of the invention provides an isolated PR0842 polypeptide which is either transmembrane 
domain-deleted or transmembrane domain-inactivated. Processes for producing the same are also herein 
described, wherein those processes comprise culturing a host cell comprising a vector which comprises the 
appropriate encoding nucleic acid molecule under conditions suitable for expression of the PR0842 polypeptide 
and recovering the PR0842 polypeptide from the cell culture. 

In yet another embodiment, the invention concerns agonists and antagonists of a native PR0842 
polypeptide as defined herein. In a particular embodiment, the agonist or antagonist is an anti-PR0842 antibody 
or a small molecule. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists to a 
PR0842 polypeptide which comprise contacting the PR0842 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said PR0842 polypeptide. Preferably, the PR0842 polypeptide is 
a native PR0842 polypeptide. 

In a still further embodiment, the invention concerns a composition of matter comprising a PR0842 
polypeptide, or an agonist or antagonist of a PR0842 polypeptide as herein described, or an anti-PR0842 
antibody, in combination with a carrier. Optionally, the carrier is a pharmaceutically acceptable carrier. 

Another embodiment of the present invention is directed to the use of a PR0842 polypeptide, or an 
agonist or antagonist thereof as hereinbefore described, or an anti-PR0842 antibody, for the preparation of a 
medicament useful in the treatment of a condition which is responsive to the PR0842 polypeptide, an agonist or 
antagonist thereof or an anti-PR0842 antibody. 

In additional embodiments of the present invention, the invention provides vectors comprising DNA 
encoding any of the herein described polypeptides. Host cell comprising any such vector are also provided. By 
way of example, the host cells may be CHO cells, E. coli, yeast, or Baculovirus-infected insect cells. A process 
for producing any of the herein described polypeptides is further provided and comprises culturing host cells under 
conditions suitable for expression of the desired polypeptide and recovering the desired polypeptide from the cell 
culture. 

In other embodiments, the invention provides chimeric molecules comprising any of the herein described 
polypeptides fused to a heterologous polypeptide or amino acid sequence. Example of such chimeric molecules 
comprise any of the herein described polypeptides fused to an epitope tag sequence or a Fc region of an 
immunoglobulin. 

In yet another embodiment, the invention provides an antibody which specifically binds to any of the 
above or below described polypeptides. Optionally, the antibody is a monoclonal antibody, humanized antibody, 
antibody fragment or single-chain antibody. 

In yet other embodiments, the invention provides oligonucleotide probes useful for isolating genomic and 
cDNA nucleotide sequences or as antisense probes, wherein those probes may be derived from any of the above 
or below described nucleotide sequences. 

13 




BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a nucleotide sequence (SEQ ID NO: 1) of a native sequence PR0842 cDNA, wherein SEQ 
ID NO:l is a clone designated herein as "DNA56855-1447": 

Figure 2 shows the amino acid sequence (SEQ ID NO: 2) derived from the coding sequence of SEQ ID 
NO: 1 shown in Figure 1 . 

5 Figure 3 shows the tissue expression pattern of DNA56855 (CK27) as demonstrated by hybridization to 

a human multiple tissue expression array. Significant expression was observed in lung (A8), stomach (B5), and 
trachea (H7). In ovary, prostate, colon and fetal lung weak expression of CK27 was detected. 

Figure 4 shows the tissue expression pattern of DNA56855 (CK27) as shown by hybridization of a 150 
bp probe to a human multiple tissue Northern Blot. Significant expression was observed in the stomach, thyroid, 
10 lymph node and trachea. 

Figure 5 shows the tissue expression pattern of murine CK27 in a mouse RNA Blot. Significant 
expression was observed in the lung (A4), thyroid (C2), submaxillary gland (C4) and uterus (D5). 

Figure 6 demonstrates that PR0842 (CK27) specifically attracts dendritic cells (CDllc + cells) as 
compared with CD3 + cells or CD16 + cells. 
15 Figure 7 shows that different lots of PR0842 (CK27) show similar chemoattractant activity. 

Figure 8 demonstrates that heat-treatment of PR0842 (CK27) reduces the chemoattraction of dendritic 
cells (CDllc + cells) and monocytic cells (CD14 + cells) to PR0842 (CK27). 

Figure 9 demonstrates the formation of ovarian cysts in human DNA56855 (CK27) transgenic mice. The 
top left image corresponds to the control (normal ovary in non-transgenic mice); the bottom left image shows 
20 cystic degeneration and hemosiderin in CK27 transgenic mice; the top right image shows the ovary with 
hemosiderin changes which extend into adjacent adipose tissue in CK27 transgenic mice; and the bottom right high 
power image shows the presence of pigment laden macrophages. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

25 I. Definitions 

The terms "PRO polypeptide" and "PRO" as used herein and when immediately followed by a numerical 
designation refer to various polypeptides, wherein the complete designation (i.e. , PRO/number) refers to specific 
polypeptide sequences as described herein. The terms "PRO/number polypeptide" and "PRO/number" wherein 
the term "number" is provided as an actual numerical designation as used herein encompass native sequence 

30 polypeptides and polypeptide variants (which are further defined herein). The PRO polypeptides described herein 
may be isolated from a variety of sources, such as from human tissue types or from another source, or prepared 
by recombinant or synthetic methods. The term "PRO polypeptide" refers to each individual PRO/number 
polypeptide disclosed herein. All disclosures in this specification which refer to the "PRO polypeptide" refer to 
each of the polypeptides individually as well as jointly. For example, descriptions of the preparation of, 

35 purification of, derivation of, formation of antibodies to or against, administration of, compositions containing, 
treatment of a disease with, etc., pertain to each polypeptide of the invention individually. The term "PRO 
polypeptide" also includes variants of the PRO/number polypeptides disclosed herein. 

14 



A "native sequence PR0842 polypeptide" comprises a polypeptide having the same amino acid sequence 
as the corresponding PR0842 polypeptide derived from nature. Such native sequence PR0842 polypeptides can 
be isolated from nature or can be produced by recombinant or synthetic means. The term "native sequence 
PR0842 polypeptide" specifically encompasses naturally-occurring truncated or secreted forms of the specific 
PR0842 polypeptide {e.g., an extracellular domain sequence), naturally-occurring variant forms {e.g., 
alternatively spliced forms) and naturally-occurring allelic variants of the polypeptide. In various embodiments 
of the invention, the native sequence PR0842 polypeptides disclosed herein are mature or full-length native 
sequence polypeptides comprising the full-length amino acid sequences shown in the accompanying figures. Start 
and stop codons are shown in bold font and underlined in the figures. However, while the PR0842 polypeptide 
disclosed in the accompanying figures are shown to begin with methionine residues designated herein as amino 
acid position 1 in the figures, it is conceivable and possible that other methionine residues located either upstream 
or downstream from the amino acid position 1 in the figures may be employed as the starting amino acid residue 
for the PR0842 polypeptides. 

The PR0842 polypeptide "extracellular domain" or "ECD" refers to a form of the PR0842 polypeptide 
which is essentially free of the transmembrane and cytoplasmic domains. Ordinarily, a PR0842 polypeptide ECD 
will have less than 1 % of such transmembrane and/or cytoplasmic domains and preferably, will have less than 
0.5% of such domains. It will be understood that any transmembrane domains identified for the PR0842 
polypeptides of the present invention are identified pursuant to criteria routinely employed in the art for identifying 
that type of hydrophobic domain. The exact boundaries of a transmembrane domain may vary but most likely by 
no more than about 5 amino acids at either end of the domain as initially identified herein. Optionally, therefore, 
an extracellular domain of a PR0842 polypeptide may contain from about 5 or fewer amino acids on either side 
of the transmembrane domain/extracellular domain boundary as identified in the Examples or specification and 
such polypeptides, with or without the associated signal peptide, and nucleic acid encoding them, are contemplated 
by the present invention. 

The approximate location of the "signal peptides " of the various PR0842 polypeptides disclosed herein 
are shown in the present specification and/or the accompanying figures. It is noted, however, that the C-terminal 
boundary of a signal peptide may vary, but most likely by no more than about 5 amino acids on either side of the 



signal peptide C-terminal boundary as initially identified herein, wherein the C-terminal boundary of the signal 
peptide may be identified pursuant to criteria routinely employed in the art for identifying that type of amino acid 
sequence element {e.g., Nielsen et al., Prot. Eng. . 10:1-6 (1997) and von Heinje et al. 9 NucL Acids. Res. . 
14:4683-4690 (1986)). Moreover, it is also recognized that, in some cases, cleavage of a signal sequence from 
a secreted polypeptide is not entirely uniform, resulting in more than one secreted species. These mature 
polypeptides, where the signal peptide is cleaved within no more than about 5 amino acids on either side of the 
C-terminal boundary of the signal peptide as identified herein, and the polynucleotides encoding them, are 
contemplated by the present invention. 

"PR0842 polypeptide variant" means an active PR0842 polypeptide as defined above or below having 
at least about 80% amino acid sequence identity with a full-length native sequence PR0842 polypeptide sequence 
as disclosed herein, a PR0842 polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular 
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domain of a PR0842 polypeptide, with or without the signal peptide, as disclosed herein or any other fragment 
of a full-length PR0842 polypeptide sequence as disclosed herein. Such PR0842 polypeptide variants include, 
for instance, PR0842 polypeptides wherein one or more amino acid residues are added, or deleted, at the - or 
C-terminus of the full-length native amino acid sequence. Ordinarily, a PR0842 polypeptide variant will have 
at least about 80% amino acid sequence identity, alternatively at least about 81% amino acid sequence identity, 
5 alternatively at least about 82% amino acid sequence identity, alternatively at least about 83 % amino acid sequence 
identity, alternatively at least about 84% amino acid sequence identity, alternatively at least about 85% amino acid 
sequence identity, alternatively at least about 86% amino acid sequence identity, alternatively at least about 87% 
amino acid sequence identity, alternatively at least about 88% amino acid sequence identity, alternatively at least 
about 89% amino acid sequence identity, alternatively at least about 90% amino acid sequence identity, 

10 alternatively at least about 91 % amino acid sequence identity, alternatively at least about 92% amino acid sequence 
identity, alternatively at least about 93 % amino acid sequence identity, alternatively at least about 94% amino acid 
sequence identity, alternatively at least about 95% amino acid sequence identity, alternatively at least about 96% 
amino acid sequence identity, alternatively at least about 97% amino acid sequence identity, alternatively at least 
about 98% amino acid sequence identity and alternatively at least about 99% amino acid sequence identity to a 

15 full-length native sequence PR0842 polypeptide sequence as disclosed herein, a PR0842 polypeptide sequence 
lacking the signal peptide as disclosed herein, an extracellular domain of a PR0842 polypeptide, with or without 
the signal peptide, as disclosed herein or any other specifically defined fragment of a full-length PR0842 
polypeptide sequence as disclosed herein. Ordinarily, PR0842 variant polypeptides are at least about 10 amino 
acids in length, alternatively at least about 20 amino acids in length, alternatively at least about 30 amino acids 

20 in length, alternatively at least about 40 amino acids in length, alternatively at least about 50 amino acids in length, 
alternatively at least about 60 amino acids in length, alternatively at least about 70 amino acids in length, 
alternatively at least about 80 amino acids in length, alternatively at least about 90 amino acids in length, 
alternatively at least about 100 amino acids in length, alternatively at least about 150 amino acids in length, 
alternatively at least about 200 amino acids in length, alternatively at least about 300 amino acids in length, or 



"Percent (%) amino acid sequence identity" with respect to the PR0842 polypeptide sequences identified 
herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the amino 
acid residues in the specific PR0842 polypeptide sequence, after aligning the sequences and introducing gaps, if 
necessary, to achieve the maximum percent sequence identity, and not considering any conservative substitutions 



be achieved in various ways that are within the skill in the art, for instance, using publicly available computer 
software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. Those skilled in the art can 
determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal 
alignment over the full length of the sequences being compared. For purposes herein, however, % amino acid 
35 sequence identity values are generated using the sequence comparison computer program ALIGN-2, wherein the 
complete source code for the ALIGN-2 program is provided in Table 1 below. The ALIGN-2 sequence 
comparison computer program was authored by Genentech, Inc. and the source code shown in Table 1 below has 



25 



more. 



30 



as part of the sequence identity. Alignment for purposes of determining percent amino acid sequence identity can 
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been filed with user documentation in the U.S. Copyright Office, Washington D.C., 20559, where it is registered 
under U.S. Copyright Registration No. TXU5 10087. The ALIGN-2 program is publicly available through 
Genentech, Inc., South San Francisco, California or may be compiled from the source code provided in Table 1 
below. The ALIGN-2 program should be compiled for use on a UNIX operating system, preferably digital UNIX 
V4.0D. All sequence comparison parameters are set by the ALIGN-2 program and do not vary. 
5 In situations where ALIGN-2 is employed for amino acid sequence comparisons, the % amino acid 

sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B (which can 
alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino acid sequence 
identity to, with, or against a given amino acid sequence B) is calculated as follows: 

10 100 times the fraction X/Y 

where X is the number of amino acid residues scored as identical matches by the sequence alignment program 
ALIGN-2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in B. 
It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino acid 

15 sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid sequence identity of 
B to A. As examples of % amino acid sequence identity calculations using this method, Tables 2 and 3 
demonstrate how to calculate the % amino acid sequence identity of the amino acid sequence designated 
"Comparison Protein" to the amino acid sequence designated "PRO", wherein "PRO" represents the amino acid 
sequence of a hypothetical PRO polypeptide of interest, "Comparison Protein" represents the amino acid sequence 

20 of a polypeptide against which the "PRO" polypeptide of interest is being compared, and "X, "Y" and "Z" each 
represent different hypothetical amino acid residues. 

Unless specifically stated otherwise, all % amino acid sequence identity values used herein are obtained 
as described in the immediately preceding paragraph using the ALIGN-2 computer program. However, % amino 
acid sequence identity values may also be obtained as described below by using the WU-BLAST-2 computer 

25 program (Altschul et al., Methods in Enzvmologv 266:460-480 (1996)). Most of the WU-BLAST-2 search 
parameters are set to the default values. Those not set to default values, i.e. , the adjustable parameters, are set 
with the following values: overlap span = 1, overlap fraction = 0.125, word threshold (T) = 11, and scoring 
matrix = BLOSUM62. When WU-BLAST-2 is employed, a % amino acid sequence identity value is determined 
by dividing (a) the number of matching identical amino acid residues between the amino acid sequence of the PRO 

30 polypeptide of interest having a sequence derived from the native PRO polypeptide and the comparison amino acid 
sequence of interest (i.e., the sequence against which the PRO polypeptide of interest is being compared which 
may be a PRO variant polypeptide) as determined by WU-BLAST-2 by (b) the total number of amino acid residues 
of the PRO polypeptide of interest. For example, in the statement "a polypeptide comprising an the amino acid 
sequence A which has or having at least 80% amino acid sequence identity to the amino acid sequence B", the 

35 amino acid sequence A is the comparison amino acid sequence of interest and the amino acid sequence B is the 
amino acid sequence of the PRO polypeptide of interest. 

Percent amino acid sequence identity may also be determined using the sequence comparison program 
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NCBI-BLAST2 (Altschul et ah, Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 sequence 
comparison program may be downloaded from http://www.ncbi.nlm.nih.gov or otherwise obtained from the 
National Institute of Health, Bethesda, MD. NCBI-BLAST2 uses several search parameters, wherein all of those 
search parameters are set to default values including, for example, unmask = yes, strand = all, expected 
occurrences = 10, minimum low complexity length = 15/5, multi-pass e-value = 0.01, constant for multi-pass 
= 25, dropoff for final gapped alignment = 25 and scoring matrix = BLOSUM62. 

In situations where NCBI-BLAST2 is employed for amino acid sequence comparisons, the % amino acid 
sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B (which can 
alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino acid sequence 
identity to, with, or against a given amino acid sequence B) is calculated as follows: 

100 times the fraction X/Y 



where X is the number of amino acid residues scored as identical matches by the sequence alignment program 
NCBI-BLAST2 in that program's alignment of A and B, and where Y is the total number of amino acid residues 
15 in B. It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino 
acid sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid sequence identity 
of B to A. 

"PR0842 variant polynucleotide" or "PR0842 variant nucleic acid sequence" means a nucleic acid 
molecule which encodes an active PR0842 polypeptide as defined below and which has at least about 80% nucleic 

20 acid sequence identity with a nucleotide acid sequence encoding a full-length native sequence PR0842 polypeptide 
sequence as disclosed herein, a full-length native sequence PR0842 polypeptide sequence lacking the signal peptide 
as disclosed herein, an extracellular domain of a PR0842 polypeptide, with or without the signal peptide, as 
disclosed herein or any other fragment of a full-length PR0842 polypeptide sequence as disclosed herein. 
Ordinarily, a PR0842 variant polynucleotide will have at least about 80% nucleic acid sequence identity, 

25 alternatively at least about 81% nucleic acid sequence identity, alternatively at least about 82% nucleic acid 
sequence identity, alternatively at least about 83% nucleic acid sequence identity, alternatively at least about 84% 
nucleic acid sequence identity, alternatively at least about 85% nucleic acid sequence identity, alternatively at least 
about 86% nucleic acid sequence identity, alternatively at least about 87% nucleic acid sequence identity, 
alternatively at least about 88% nucleic acid sequence identity, alternatively at least about 89% nucleic acid 

30 sequence identity, alternatively at least about 90% nucleic acid sequence identity, alternatively at least about 91 % 
nucleic acid sequence identity, alternatively at least about 92% nucleic acid sequence identity, alternatively at least 
about 93% nucleic acid sequence identity, alternatively at least about 94% nucleic acid sequence identity, 
alternatively at least about 95% nucleic acid sequence identity, alternatively at least about 96% nucleic acid 
sequence identity, alternatively at least about 97% nucleic acid sequence identity, alternatively at least about 98% 

35 nucleic acid sequence identity and alternatively at least about 99% nucleic acid sequence identity with a nucleic 
acid sequence encoding a full-length native sequence PR0842 polypeptide sequence as disclosed herein, a full- 
length native sequence PR0842 polypeptide sequence lacking the signal peptide as disclosed herein, an 
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extracellular domain of a PR0842 polypeptide, with or without the signal sequence, as disclosed herein or any 
other fragment of a full-length PR0842 polypeptide sequence as disclosed herein. Variants do not encompass the 
native nucleotide sequence. 

Ordinarily, PR0842 variant polynucleotides are at least about 30 nucleotides in length, alternatively at 
least about 60 nucleotides in length, alternatively at least about 90 nucleotides in length, alternatively at least about 
5 120 nucleotides in length, alternatively at least about 150 nucleotides in length, alternatively at least about 180 
nucleotides in length, alternatively at least about 210 nucleotides in length, alternatively at least about 240 
nucleotides in length, alternatively at least about 270 nucleotides in length, alternatively at least about 300 
nucleotides in length, alternatively at least about 450 nucleotides in length, alternatively at least about 600 
nucleotides in length, alternatively at least about 900 nucleotides in length, or more. 

10 "Percent (%) nucleic acid sequence identity" with respect to PRO-encoding nucleic acid sequences 

identified herein is defined as the percentage of nucleotides in a candidate sequence that are identical with the 
nucleotides in the PRO nucleic acid sequence of interest, after aligning the sequences and introducing gaps, if 
necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining percent 
nucleic acid sequence identity can be achieved in various ways that are within the skill in the art, for instance, 

15 using publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) 
software. For purposes herein, however, % nucleic acid sequence identity values are generated using the sequence 
comparison computer program ALIGN-2, wherein the complete source code for the ALIGN-2 program is provided 
in Table 1 below. The ALIGN-2 sequence comparison computer program was authored by Genentech, Inc. and 
the source code shown in Table 1 below has been filed with user documentation in the U.S. Copyright Office, 

20 Washington D.C., 20559, where it is registered under U.S. Copyright Registration No. TXU5 10087. The 
ALIGN-2 program is publicly available through Genentech, Inc., South San Francisco, California or may be 
compiled from the source code provided in Table 1 below. The ALIGN-2 program should be compiled for use 
on a UNIX operating system, preferably digital UNIX V4.0D. All sequence comparison parameters are set by 
the ALIGN-2 program and do not vary. 

25 In situations where ALIGN-2 is employed for nucleic acid sequence comparisons, the % nucleic acid 

sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which 
can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % nucleic acid 
sequence identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 

30 100 times the fraction W/Z 

where W is the number of nucleotides scored as identical matches by the sequence alignment program ALIGN-2 
in that program's alignment of C and D, and where Z is the total number of nucleotides in D. It will be 
appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence 
35 D, the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence identity of D to C. 
As examples of % nucleic acid sequence identity calculations, Tables 4 and 5, demonstrate how to calculate the 
% nucleic acid sequence identity of the nucleic acid sequence designated "Comparison DNA" to the nucleic acid 
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sequence designated "PRO-DNA", wherein "PRO-DNA" represents a hypothetical PRO-encoding nucleic acid 
sequence of interest, "Comparison DNA" represents the nucleotide sequence of a nucleic acid molecule against 
which the "PRO-DNA" nucleic acid molecule of interest is being compared, and "N", "L" and "V" each represent 
different hypothetical nucleotides. 

Unless specifically stated otherwise, all % nucleic acid sequence identity values used herein are obtained 
5 as described in the immediately preceding paragraph using the ALIGN-2 computer program. However, % nucleic 
acid sequence identity values may also be obtained as described below by using the WU-BLAST-2 computer 
program (Altschul et al., Methods in Enzvmologv 266:460-480 (1996)). Most of the WU-BLAST-2 search 
parameters are set to the default values. Those not set to default values, i.e. , the adjustable parameters, are set 
with the following values: overlap span = 1, overlap fraction = 0.125, word threshold (T) = 11, and scoring 

10 matrix = BLOSUM62. When WU-BLAST-2 is employed, a % nucleic acid sequence identity value is determined 
by dividing (a) the number of matching identical nucleotides between the nucleic acid sequence of the PRO 
polypeptide-encoding nucleic acid molecule of interest having a sequence derived from the native sequence PRO 
polypeptide-encoding nucleic acid and the comparison nucleic acid molecule of interest (i.e. , the sequence against 
which the PRO polypeptide-encoding nucleic acid molecule of interest is being compared which may be a variant 

15 PRO polynucleotide) as determined by WU-BLAST-2 by (b) the total number of nucleotides of the PRO 
polypeptide-encoding nucleic acid molecule of interest. For example, in the statement "an isolated nucleic acid 
molecule comprising a nucleic acid sequence A which has or having at least 80% nucleic acid sequence identity 
to the nucleic acid sequence B", the nucleic acid sequence A is the comparison nucleic acid molecule of interest 
and the nucleic acid sequence B is the nucleic acid sequence of the PRO polypeptide-encoding nucleic acid 

20 molecule of interest. 

Percent nucleic acid sequence identity may also be determined using the sequence comparison program 
NCBI-BLAST2 (Altschul et al., Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 sequence 
comparison program may be downloaded from http://www.ncbi.nlm.nih.gov or otherwise obtained from the 
National Institute of Health, Bethesda, MD. NCBI-BLAST2 uses several search parameters, wherein all of those 

25 search parameters are set to default values including, for example, unmask = yes, strand = all, expected 
occurrences = 10, minimum low complexity length = 15/5, multi-pass e-value = 0.01, constant for multi-pass 
= 25, dropoff for final gapped alignment = 25 and scoring matrix = BLOSUM62. 

In situations where NCBI-BLAST2 is employed for sequence comparisons, the % nucleic acid sequence 
identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which can 

30 alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % nucleic acid sequence 
identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 

100 times the fraction W/Z 

35 where W is the number of nucleotides scored as identical matches by the sequence alignment program NCBI- 
BLAST2 in that program's alignment of C and D, and where Z is the total number of nucleotides in D. It will 
be appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence 
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D, the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence identity of D to C. 

In other embodiments, PR0842 variant polynucleotides are nucleic acid molecules that encode an active 
PR0842 polypeptide and which are capable of hybridizing, preferably under stringent hybridization and wash 
conditions, to nucleotide sequences encoding a full-length PR0842 polypeptide as disclosed herein. PR0842 
variant polypeptides may be those that are encoded by a PR0842 variant polynucleotide. 
5 "Isolated," when used to describe the various polypeptides disclosed herein, means polypeptide that has 

been identified and separated and/or recovered from a component of its natural environment. Contaminant 
components of its natural environment are materials that would typically interfere with diagnostic or therapeutic 
uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or non-proteinaceous 
solutes. In preferred embodiments, the polypeptide will be purified (1) to a degree sufficient to obtain at least 15 

10 residues of N-terminal or internal amino acid sequence by use of a spinning cup sequenator, or (2) to homogeneity 
by SDS-PAGE under non-reducing or reducing conditions using Coomassie blue or, preferably, silver stain. 
Isolated polypeptide includes polypeptide in situ within recombinant cells, since at least one component of the 
PR0842 polypeptide natural environment will not be present. Ordinarily, however, isolated polypeptide will be 
prepared by at least one purification step. 

15 An "isolated" PR0842 polypeptide-encoding nucleic acid or other polypeptide-encoding nucleic acid is 

a nucleic acid molecule that is identified and separated from at least one contaminant nucleic acid molecule with 
which it is ordinarily associated in the natural source of the polypeptide-encoding nucleic acid. An isolated 
polypeptide-encoding nucleic acid molecule is other than in the form or setting in which it is found in nature. 
Isolated polypeptide-encoding nucleic acid molecules therefore are distinguished from the specific polypeptide- 

20 encoding nucleic acid molecule as it exists in natural cells. However, an isolated polypeptide-encoding nucleic 
acid molecule includes polypeptide-encoding nucleic acid molecules contained in cells that ordinarily express the 
polypeptide where, for example, the nucleic acid molecule is in a chromosomal location different from that of 
natural cells. 

The term "control sequences" refers to DNA sequences necessary for the expression of an operably linked 
25 coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, for 
example, include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are 
known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic acid 
sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a polypeptide 
30 if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or enhancer is 
operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is 
operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, "operably linked" 
means that the DNA sequences being linked are contiguous, and, in the case of a secretory leader, contiguous and 
in reading phase. However, enhancers do not have to be contiguous. Linking is accomplished by ligation at 
35 convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide adaptors or linkers are used 
in accordance with conventional practice. 

The term "antibody" is used in the broadest sense and specifically covers, for example, single anti- 
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PR0842 monoclonal antibodies (including agonist, antagonist, and neutralizing antibodies), anti-PR0842 antibody 
compositions with polyepitopic specificity, single chain anti-PR0842 antibodies, and fragments of anti-PR0842 
antibodies (see below). The term "monoclonal antibody" as used herein refers to an antibody obtained from a 
population of substantially homogeneous antibodies, i.e., the individual antibodies comprising the population are 
identical except for possible naturally-occurring mutations that may be present in minor amounts. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, and 
generally is an empirical calculation dependent upon probe length, washing temperature, and salt concentration. 
In general, longer probes require higher temperatures for proper annealing, while shorter probes need lower 
temperatures. Hybridization generally depends on the ability of denatured DNA to reanneal when complementary 
strands are present in an environment below their melting temperature. The higher the degree of desired 
homology between the probe and hybridizable sequence, the higher the relative temperature which can be used. 
As a result, it follows that higher relative temperatures would tend to make the reaction conditions more stringent, 
while lower temperatures less so. For additional details and explanation of stringency of hybridization reactions, 
see Ausubel et al., Current Protocols in Molecular Biology , Wiley Interscience Publishers, (1995). 

"Stringent conditions" or "high stringency conditions" , as defined herein, may be identified by those that: 
(1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium chloride/0.0015 
M sodium citrate/0. 1 % sodium dodecyl sulfate at 50°C; (2) employ during hybridization a denaturing agent, such 
as formamide, for example, 50% (v/v) formamide with 0.1% bovine serum albumin/0.1% Ficoll/0.1% 
polyvinylpyrrolidone/50mM sodium phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium 
citrate at 42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium 
phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 
/Ag/ml), 0. 1 % SDS, and 10% dextran sulfate at 42°C, with washes at 42°C in 0.2 x SSC (sodium chloride/sodium 
citrate) and 50% formamide at 55 °C, followed by a high-stringency wash consisting of 0.1 x SSC containing 
EDTA at 55°C. 

"Moderately stringent conditions " may be identified as described by Sambrook et al. , Molecular Cloning: 
A Laboratory Manual . New York: Cold Spring Harbor Press, 1989, and include the use of washing solution and 
hybridization conditions {e.g., temperature, ionic strength and %SDS) less stringent that those described above. 
An example of moderately stringent conditions is overnight incubation at 37° C in a solution comprising: 20% 
formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5 x 
Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon sperm DNA, followed by 
washing the filters in 1 x SSC at about 37-50°C. The skilled artisan will recognize how to adjust the temperature, 
ionic strength, etc. as necessary to accommodate factors such as probe length and the like. 

The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising a PR0842 
polypeptide fused to a "tag polypeptide". The tag polypeptide has enough residues to provide an epitope against 
which an antibody can be made, yet is short enough such that it does not interfere with activity of the polypeptide 
to which it is fused. The tag polypeptide preferably also is fairly unique so that the antibody does not substantially 
cross-react with other epitopes. Suitable tag polypeptides generally have at least six amino acid residues and 
usually between about 8 and 50 amino acid residues (preferably, between about 10 and 20 amino acid residues). 
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As used herein, the term "immunoadhesin" designates antibody-like molecules which combine the binding 
specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin constant 
domains. Structurally, the immunoadhesins comprise a fusion of an amino acid sequence with the desired binding 
specificity which is other than the antigen recognition and binding site of an antibody (i.e. , is "heterologous"), and 
an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin molecule typically is a 
5 contiguous amino acid sequence comprising at least the binding site of a receptor or a ligand. The immunoglobulin 
constant domain sequence in the immunoadhesin may be obtained from any immunoglobulin, such as IgG-1 , IgG- 
2, IgG-3, or IgG-4 subtypes, IgA (including IgA-1 and IgA-2), IgE, IgD or IgM. 

The term "antagonist" is used in the broadest sense, and includes any molecule that partially or fully 
blocks, inhibits, or neutralizes a biological activity of a native PR0842 polypeptide disclosed herein. In a similar 
10 manner, the term "agonist" is used in the broadest sense and includes any molecule that mimics a biological 
activity of a native PR0842 polypeptide disclosed herein. Suitable agonist or antagonist molecules specifically 
include agonist or antagonist antibodies or antibody fragments, fragments or amino acid sequence variants of native 
PR0842 polypeptides, peptides, antisense oligonucleotides, small organic molecules, etc. Methods for identifying 
agonists or antagonists of a PR0842 polypeptide may comprise contacting a PR0842 polypeptide with a candidate 
15 agonist or antagonist molecule and measuring a detectable change in one or more biological activities normally 
associated with the PR0842 polypeptide. 

"Treatment" refers to both therapeutic treatment and prophylactic or preventative measures, wherein the 
object is to prevent or slow down (lessen) the targeted pathologic condition or disorder. Those in need of 
treatment include those already with the disorder as well as those prone to have the disorder or those in whom the 
20 disorder is to be prevented. 

"Chronic" administration refers to administration of the agent(s) in a continuous mode as opposed to an 
acute mode, so as to maintain the initial therapeutic effect (activity) for an extended period of time. "Intermittent" 
administration is treatment that is not consecutively done without interruption, but rather is cyclic in nature. 

"Mammal" for purposes of treatment refers to any animal classified as a mammal, including humans, 
25 domestic and farm animals, and zoo, sports, or pet animals, such as dogs, cats, cattle, horses, sheep, pigs, goats, 
rabbits, etc. Preferably, the mammal is human. 

Administration "in combination with" one or more further therapeutic agents includes simultaneous 
(concurrent) and consecutive administration in any order. 

"Carriers" as used herein include pharmaceutically acceptable carriers, excipients, or stabilizers which 
30 are nontoxic to the cell or mammal being exposed thereto at the dosages and concentrations employed. Often the 
physiologically acceptable carrier is an aqueous pH buffered solution. Examples of physiologically acceptable 
carriers include buffers such as phosphate, citrate, and other organic acids; antioxidants including ascorbic acid; 
low molecular weight (less than about 10 residues) polypeptide; proteins, such as serum albumin, gelatin, or 
immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, glutamine, 
35 asparagine, arginine or lysine; monosaccharides, disaccharides, and other carbohydrates including glucose, 
mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt-forming 
counterions such as sodium; and/or nonionic surfactants such as TWEEN™, polyethylene glycol (PEG), and 
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PLURONICS™. 

"Antibody fragments" comprise a portion of an intact antibody, preferably the antigen binding or variable 
region of the intact antibody. Examples of antibody fragments include Fab, Fab', F(ab') 2 , and Fv fragments; 
diabodies; linear antibodies (Zapata et al., Protein Eng. . 8(10) : 1057-1062 [1995]); single-chain antibody 
molecules; and multispecific antibodies formed from antibody fragments. 

Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" fragments, 
each with a single antigen-binding site, and a residual "Fc" fragment, a designation reflecting the ability to 
crystallize readily. Pepsin treatment yields an F(ab*) 2 fragment that has two antigen-combining sites and is still 
capable of cross-linking antigen. 

"Fv" is the minimum antibody fragment which contains a complete antigen-recognition and -binding site. 
This region consists of a dimer of one heavy- and one light-chain variable domain in tight, non-covalent 
association. It is in this configuration that the three CDRs of each variable domain interact to define an antigen- 
binding site on the surface of the V H -V L dimer. Collectively, the six CDRs confer antigen-binding specificity to 
the antibody. However, even a single variable domain (or half of an Fv comprising only three CDRs specific for 
an antigen) has the ability to recognize and bind antigen, although at a lower affinity than the entire binding site. 

The Fab fragment also contains the constant domain of the light chain and the first constant domain (CHI) 
of the heavy chain. Fab fragments differ from Fab' fragments by the addition of a few residues at the carboxy 
terminus of the heavy chain CHI domain including one or more cysteines from the antibody hinge region. Fab'- 
SH is the designation herein for Fab* in which the cysteine residue(s) of the constant domains bear a free thiol 
group. F(ab') 2 antibody fragments originally were produced as pairs of Fab' fragments which have hinge cysteines 
between them. Other chemical couplings of antibody fragments are also known. 

The "light chains" of antibodies (immunoglobulins) from any vertebrate species can be assigned to one 
of two clearly distinct types, called kappa and lambda, based on the amino acid sequences of their constant 
domains. 

Depending on the amino acid sequence of the constant domain of their heavy chains, immunoglobulins 
can be assigned to different classes. There are five major classes of immunoglobulins: IgA, IgD, IgE, IgG, and 
IgM, and several of these may be further divided into subclasses (isotypes), e.g. , IgGl, IgG2, IgG3, IgG4, IgA, 
and IgA2. 

"Single-chain Fv" or "sFv" antibody fragments comprise the V H and V L domains of antibody, wherein 
these domains are present in a single polypeptide chain. Preferably, the Fv polypeptide further comprises a 
polypeptide linker between the V H and V L domains which enables the sFv to form the desired structure for antigen 
binding. For a review of sFv, see Pluckthun in The Pharmacology of Monoclonal Antibodies , vol. 113, 
Rosenburg and Moore eds., Springer- Verlag, New York, pp. 269-315 (1994). 

The term "diabodies" refers to small antibody fragments with two antigen-binding sites, which fragments 
comprise a heavy-chain variable domain (V H ) connected to a light-chain variable domain (VJ in the same 
polypeptide chain (V h -Vl). By using a linker that is too short to allow pairing between the two domains on the 
same chain, the domains are forced to pair with the complementary domains of another chain and create two 
antigen-binding sites. Diabodies are described more fully in, for example, EP 404,097; WO 93/11161; and 
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Hollinger et al., Proc. Natl. Acad. Sci. USA . 90:6444-6448 (1993). 

An "isolated" antibody is one which has been identified and separated and/or recovered from a component 
of its natural environment. Contaminant components of its natural environment are materials which would 
interfere with diagnostic or therapeutic uses for the antibody, and may include enzymes, hormones, and other 
proteinaceous or nonproteinaceous solutes. In preferred embodiments, the antibody will be purified (1) to greater 
5 than 95% by weight of antibody as determined by the Lowry method, and most preferably more than 99% by 
weight, (2) to a degree sufficient to obtain at least 15 residues of N-terminal or internal amino acid sequence by 
use of a spinning cup sequenator, or (3) to homogeneity by SDS-PAGE under reducing or nonreducing conditions 
using Coomassie blue or, preferably, silver stain. Isolated antibody includes the antibody in situ within 
recombinant cells since at least one component of the antibody's natural environment will not be present. 
10 Ordinarily, however, isolated antibody will be prepared by at least one purification step. 

An antibody that "specifically binds to" or is "specific for" a particular polypeptide or an epitope on a 
particular polypeptide is one that binds to that particular polypeptide or epitope on a particular polypeptide without 
substantially binding to any other polypeptide or polypeptide epitope. 

The word "label" when used herein refers to a detectable compound or composition which is conjugated 
15 directly or indirectly to the antibody so as to generate a "labeled" antibody. The label may be detectable by itself 
{e.g. radioisotope labels or fluorescent labels) or, in the case of an enzymatic label, may catalyze chemical 
alteration of a substrate compound or composition which is detectable. 

By "solid phase" is meant a non-aqueous matrix to which the antibody of the present invention can 
adhere. Examples of solid phases encompassed herein include those formed partially or entirely of glass (e.g., 
20 controlled pore glass), polysaccharides (e.g., agarose), polyacrylamides, polystyrene, polyvinyl alcohol and 
silicones. In certain embodiments, depending on the context, the solid phase can comprise the well of an assay 
plate; in others it is a purification column (e.g., an affinity chromatography column). This term also includes a 
discontinuous solid phase of discrete particles, such as those described in U.S. Patent No. 4,275,149. 

A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/or surfactant 
25 which is useful for delivery of a drug (such as a PR0842 polypeptide or antibody thereto) to a mammal. The 
components of the liposome are commonly arranged in a bilayer formation, similar to the lipid arrangement of 
biological membranes. 

A "small molecule" is defined herein to have a molecular weight below about 500 Dal tons. 
The term "modulate" means to affect (e.g. , either upregulate, downregulate or otherwise control) the 
30 level of a signaling pathway. Cellular processes under the control of signal transduction include, but are not 
limited to, transcription of specific genes, normal cellular functions, such as metabolism, proliferation, 
differentiation, adhesion, apoptosis and survival, as well as abnormal processes, such as transformation, blocking 
of differentiation and metastasis. 

"Active" or "activity" for the purposes herein refers to form(s) of a PR0842 polypeptide which retain 
35 a biological and/or an immunological activity of native or naturally-occurring PR0842 polypeptides, wherein 
"biological" activity refers to a biological function (either inhibitory or stimulatory) caused by a native or 
naturally-occurring PR0842 polypeptide other than the ability to induce the production of an antibody against an 



antigenic epitope possessed by a native or naturally-occurring PR0842 polypeptide and an " immunological " 
activity refers to the ability to induce the production of an antibody against an antigenic epitope possessed by a 
native or naturally-occurring PR0842 polypeptide. 

An "immunological" activity refers only to the ability to induce the production of an antibody against an 
antigenic epitope possessed by a native or naturally-occurring PR0842 polypeptide. 
5 The term "immune related disease" means a disease in which a component of the immune system of a 

mammal causes, mediates or otherwise contributes to a morbidity in the mammal. Also included are diseases in 
which stimulation or intervention of the immune response has an ameliorative effect on progression of the disease. 
Included within this term are immune-mediated inflammatory diseases, non-immune-mediated inflammatory 
diseases, infectious diseases, immunodeficiency diseases, neoplasia, etc. 

10 The term "T cell mediated disease" means a disease in which T cells directly or indirectly mediate or 

otherwise contribute to a morbidity in a mammal. The T cell mediated disease may be associated with cell 
mediated effects, lymphokine mediated effects, etc., and even effects associated with B cells if the B cells are 
stimulated, for example, by the lymphokines secreted by T cells. 

Examples of immune-related and inflammatory diseases, some of which are immune or T cell mediated, 

1 5 which can be treated according to the invention include systemic lupus erythematosis , rheumatoid arthritis , juvenile 
chronic arthritis, spondyloarthropathies, systemic sclerosis (scleroderma), idiopathic inflammatory myopathies 
(dermatomyositis, polymyositis), Sjogren's syndrome, systemic vasculitis, sarcoidosis, autoimmune hemolytic 
anemia (immune pancytopenia, paroxysmal nocturnal hemoglobinuria), autoimmune thrombocytopenia (idiopathic 
thrombocytopenic purpura, immune-mediated thrombocytopenia), thyroiditis (Grave's disease, Hashimoto's 

20 thyroiditis, juvenile lymphocytic thyroiditis, atrophic thyroiditis), diabetes mellitus, immune-mediated renal disease 
(glomerulonephritis, tubulointerstitial nephritis), demyelinating diseases of the central and peripheral nervous 
systems such as multiple sclerosis, idiopathic demyelinating polyneuropathy or Guillain-Barre syndrome, and 
chronic inflammatory demyelinating polyneuropathy, hepatobiliary diseases such as infectious hepatitis (hepatitis 
A, B, C, D, E and other non-hepatotropic viruses), autoimmune chronic active hepatitis, primary biliary cirrhosis, 

25 granulomatous hepatitis, and sclerosing cholangitis, inflammatory bowel disease (ulcerative colitis: Crohn's 
disease), gluten-sensitive enteropathy, and Whipple's disease, autoimmune or immune-mediated skin diseases 
including bullous skin diseases, erythema multiforme and contact dermatitis, psoriasis, allergic diseases such as 
asthma, allergic rhinitis, atopic dermatitis, food hypersensitivity and urticaria, immunologic diseases of the 
ovaries, immunologic diseases of the lung such as eosinophilic pneumonia, idiopathic pulmonary fibrosis and 

30 hypersensitivity pneumonitis, transplantation associated diseases including graft rejection and graft 
-versus-host-disease. Infectious diseases including viral diseases such as AIDS (HIV infection), hepatitis A, B, 
C, D, and E, herpes, etc., bacterial infections, fungal infections, protozoal infections and parasitic infections. 

The term "effective amount" is a concentration or amount of a PR0842 polypeptide and/or 
agonist/antagonist which results in achieving a particular stated purpose. An "effective amount" of a PR0842 

35 polypeptide or agonist or antagonist thereof may be determined empirically. Furthermore, a "therapeutically 
effective amount" is a concentration or amount of a PR0842 polypeptide and/or agonist/antagonist which is 
effective for achieving a stated therapeutic effect. This amount may also be determined empirically. 




The term "cytotoxic agent" as used herein refers to a substance that inhibits or prevents the function of 
cells and/or causes destruction of cells. The term is intended to include radioactive isotopes (e.g., I 131 , I 125 , Y 90 
and Re 186 ), chemotherapeutic agents, and toxins such as enzymatically active toxins of bacterial, fungal, plant or 
animal origin, or fragments thereof. 

A "chemotherapeutic agent" is a chemical compound useful in the treatment of cancer. Examples of 
5 chemotherapeutic agents include adriamycin, doxorubicin, epirubicin, 5-fluorouracil, cytosine arabinoside 
("Ara-C"), cyclophosphamide, thiotepa, busulfan, cytoxin, taxoids, e.g., paclitaxel (Taxol, Bristol-Myers Squibb 
Oncology, Princeton, NJ), and doxetaxel (Taxotere, Rhone-Poulenc Rorer, Antony, France), toxotere, 
methotrexate, cisplatin, melphalan, vinblastine, bleomycin, etoposide, ifosfamide, mitomycin C, mitoxantrone, 
vincristine, vinorelbine, carboplatin, teniposide, daunomycin, carminomycin, aminopterin, dactinomycin, 
10 mitomycins, esperamicins (see U.S. Pat. No. 4,675,187), melphalan and other related nitrogen mustards. Also 
included in this definition are hormonal agents that act to regulate or inhibit hormone action on tumors such as 
tamoxifen and onapristone. 

A "growth inhibitory agent" when used herein refers to a compound or composition which inhibits growth 
of a cell, especially cancer cell overexpressing any of the genes identified herein, either in vitro or in vivo. Thus, 

15 the growth inhibitory agent is one which significantly reduces the percentage of cells overexpressing such genes 
in S phase. Examples of growth inhibitory agents include agents that block cell cycle progression (at a place other 
than S phase), such as agents that induce Gl arrest and M-phase arrest. Classical M-phase blockers include the 
vincas (vincristine and vinblastine), taxol, and topo II inhibitors such as doxorubicin, epirubicin, daunorubicin, 
etoposide, and bleomycin. Those agents that arrest Gl also spill over into S-phase arrest, for example, DNA 

20 alkylating agents such as tamoxifen, prednisone, dacarbazine, mechlorethamine, cisplatin, methotrexate, 
5-fluorouracil, and ara-C. Further information can be found in The Molecular Basis of Cancer . Mendelsohn and 
Israel, eds., Chapter 1, entitled "Cell cycle regulation, oncogens, and antineoplastic drugs" by Murakami etal., 
(WB Saunders: Philadelphia, 1995), especially p. 13. 

The term "cytokine" is a generic term for proteins released by one cell population which act on another 

25 cell as intercellular mediators. Examples of such cytokines are lymphokines, monokines, and traditional 
polypeptide hormones. Included among the cytokines are growth hormone such as human growth hormone, 
N-methionyl human growth hormone, and bovine growth hormone; parathyroid hormone; thyroxine; insulin; 
proinsulin; relaxin; prorelaxin; glycoprotein hormones such as follicle stimulating hormone (FSH), thyroid 
stimulating hormone (TSH), and luteinizing hormone (LH); hepatic growth factor; fibroblast growth factor; 

30 prolactin; placental lactogen; tumor necrosis factor-a and -p; mullerian-inhibiting substance; mouse 
gonadotropin-associated peptide; inhibin; activin; vascular endothelial growth factor; integrin; thrombopoietin 
(TPO); nerve growth factors such as NGF-P; platelet-growth factor; transforming growth factors (TGFs) such as 
TGF-a and TGF-P; insulin-like growth factor-I and -II; erythropoietin (EPO); osteoinductive factors; interferons 
such as interferon-a, -P, and -y; colony stimulating factors (CSFs) such as macrophage-CSF (M-CSF); 

35 granulocyte-macrophage-CSF (GM-CSF); and granulocyte-CSF (G-CSF); interleukins (ILs) such as IL- 1 , IL- la, 
IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, or IL-17; a tumor necrosis factor such as 
TNF-a or TNF-p; and other polypeptide factors including leukemia inhibitory factor (LIF) and kit ligand (KL). 
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As used herein, the term cytokine includes proteins from natural sources or from recombinant cell culture and 
biologically active equivalents of the native sequence cytokines. 

The term "chemokine" refers to a class of molecules belongiong to the family of proinflammatory 
cytokines of low molecular mass (8-11 kDa) characterized by a structurally conserved motif and their ability to 
mediate leukocyte chemotaxis, thus playing a role in leukocyte trafficking as well as playing a role in leukocyte 
5 regulation. 
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Table 1 

/* 
* 

* C-C increased from 12 to 15 

* Z is average of EQ 

* B is average of ND 

* match with stop is _M; stop-stop = 0; J (joker) match = 0 
*/ 

^define _M -8 /* value of a match with a stop */ 



int _day[26][26] = { 

/* ABCDEFGHIJKLMNO 

/* A */ { 2, 0,-2, 0, 0,-4, 1,-1,-1, 0,-1,-2,-1, 0, 

/* B */ { 0, 3,-4, 3, 2,-5, 0, 1,-2, 0, 0,-3,-2, 2, 

/* C */ {-2,-4,15,-5,-5,-4,-3,-3,-2, 0,-5,-6,-5,-4 

/* D */ { 0, 3,-5, 4, 3,-6, 1, 1,-2, 0, 0,-4,-3, 2, 

/* E */ { 0, 2,-5, 3, 4,-5, 0, 1,-2, 0, 0,-3,-2, 1, 

/* F */ {-4,-5,-4,-6,-5, 9,-5,-2, 1, 0,-5, 2, 0,-4* 

/* G */ { 1, 0,-3, 1, 0,-5, 5,-2,-3, 0,-2,-4,-3, 0, 

/* H */ {-1, 1,-3, 1, 1,-2,-2, 6,-2, 0, 0,-2,-2, 2, 

/* I */ {-1,-2,-2,-2,-2, 1,-3,-2, 5, 0,-2, 2, 2,-2, 

/* J */ { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,J 

/* K */ {-1, 0,-5, 0, 0,-5,-2, 0,-2, 0, 5,-3, 0, 1, 

/* L */ {-2,-3,-6,-4,-3, 2,-4,-2, 2, 0,-3, 6, 4,-3, 

/* M */ {-1,-2,-5,-3,-2, 0,-3,-2, 2, 0, 0, 4, 6,-2, 

/* N */ { 0, 2,-4, 2, 1,-4, 0, 2,-2, 0, 1,-3,-2, 2,_ 

/* O */ {_M,_M,_M,_M,_M,_M,_M,_M,_M,^M 

/* P */ { 1,-1,-3,-1,-1,-5,-1, 0,-2, 0,-1,-3,-2,-1, 

/* Q */ { 0, 1,-5, 2, 2,-5,-1, 3,-2, 0, 1,-2,-1, 1, 

/* R */ {-2, 0,-4,-1,-1,-4,-3, 2,-2, 0, 3,-3, 0, 0, 

/* S */ { 1, 0, 0, 0, 0,-3, 1,-1,-1, 0, 0,-3,-2, 1, 

/* T */ { 1, 0,-2, 0, 0,-3, 0,-1, 0, 0, 0,-1,-1, 0,_ 

/* U */ { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 

/* V */ { 0,-2,-2,-2,-2,-1,-1,-2, 4, 0,-2, 2, 2,-2, 

/* W */ {-6,-5,-8,-7,-7, 0,-7,-3,-5, 0,-3,-2,-4,-4," 

/* X */ { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,_ 

/* Y */ {-3,-3, 0,-4,-4, 7,-5, 0,-1, 0,-4,-1,-2,-2, 

/* Z */ { 0, 1,-5, 2, 3,-5, 0, 2,-2, 0, 0,-2,-1, 1, 

}; 



PQRSTUVWXYZ*/ 
M, 1, 0,-2, 1,1,0, 0,-6, 0,-3, 0}, 
~M,-1, 1, 0, 0, 0, 0,-2,-5, 0,-3, 1}, 
,_M,-3,-5,-4, 0,-2, 0,-2,-8, 0, 0,-5}, 
M,-l, 2,-1, 0, 0, 0,-2,-7, 0,-4, 2}, 
~M,-1, 2,-1, 0, 0, 0,-2,-7, 0,-4, 3}, 
lM,-5,-5,-4,-3,-3, 0,-1, 0, 0, 7,-5}, 
_M,-l,-l,-3, 1,0, 0,-1,-7, 0,-5,0}, 
M, 0, 3,2,-1,-1,0,-2,-3, 0, 0, 2}, 
_M,-2,-2,-2,-l, 0, 0, 4,-5, 0,-1,-2}, 
M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 
M,-l, 1, 3, 0, 0, 0,-2,-3, 0,-4, 0}, 
M,-3, -2,-3,-3,-1, 0, 2,-2, 0,-1,-2}, 
M,-2,-l, 0,-2,-1, 0, 2,-4, 0,-2,-1}, 
M,-l, 1, 0, 1, 0, 0,-2,-4, 0,-2, 1}, 
M,_M,_M,_M, 0,_M,_M,_M,_M,_M,_M,_M,^M,_M,_M,_M}, 



M, 6, 0, 0, 1, 0, 0,-1,-6, 0,-5, 0}, 
M, 0, 4, 1,-1,-1,0,-2,-5,0,-4, 3}, 
_M, 0, 1, 6, 0,-1, 0,-2, 2, 0,-4, 0}, 
M, 1,-1, 0, 2, 1, 0,-1,-2, 0,-3, 0}, 
M, 0,-1,-1, 1, 3, 0, 0,-5, 0,-3, 0}, 
M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 
_M,-l,-2,-2,-l, 0, 0, 4,-6, 0,-2,-2}, 
M,-6,-5, 2,-2,-5, 0,-6,17, 0, 0,-6}, 
M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 
_M,-5,-4,-4,-3,-3, 0,-2, 0, 0,10,-4}, 
M, 0, 3, 0, 0, 0, 0,-2,-6, 0,-4, 4} 
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/* 
*/ 

^include <stdio.h> 
^include <ctype.h> 



Table 1 (conV) 
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#define 


MAXJMP 


16 


/* max jumps in a diag */ 


^define 


MAXGAP 


24 


/* don't continue to penalize gaps larger than this */ 


//define 


JMPS 


1024 


/* max jmps in an path */ 


//define 


MX 


4 


/* save if there's at least MX-1 bases since last jmp */ 


#define 


DMAT 


3 


/* value of matching bases */ 


//define 


DMIS 


0 


/* penalty for mismatched bases */ 


//define 


DINSO 


8 


/* penalty for a gap */ 


//define 


DINS1 


1 


/* penalty per base */ 


//define 


PINSO 


8 


/* penalty for a gap */ 


//define 


PINS1 


4 


/* penalty per residue */ 



20 



25 



30 



35 



40 



45 



50 
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struct jmp { 

short 

unsigned short 

}; 



n[MAXJMP]; 
x[MAXJMP]; 



/* size of jmp (neg for dely) */ 
/* base no. of jmp in seq x */ 
/* limits seq to 2* 16 -1 */ 



struct diag { 








int 


score; 


/* score at last jmp */ 




long 


offset; 


/* offset of prev block */ 




short 


ijmp; 


/* current jmp index */ 


>; 


struct jmp jp; 


/* list of jmps */ 


struct path { 








int 


spc; /* 


number of leading spaces */ 




short 


n[JMPS];/* size of jmp (gap) */ 


}; 


int 


x[JMPS];/* loc of jmp (last elem before gap) */ 


char 




*ofile; 


/* output file name */ 


char 




*namex[2]; 


/* seq names: getseqs() */ 


char 




*prog; 


/* prog name for err msgs */ 


char 




*seqx[2]; 


/* seqs: getseqs() */ 


int 




dmax; 


/* best diag: nw() */ 


int 




dmaxO; 


/* final diag */ 


int 




dna; 


/* set if dna: main() */ 


int 




endgaps; 


/* set if penalizing end gaps */ 


int 




gapx, gapy; 


/* total gaps in seqs */ 


int 




lenO, lenl; 


/* seq lens */ 


int 




ngapx, ngapy; 


/* total size of gaps */ 


int 




smax; 


/* max score: nw() */ 


int 




*xbm; 


/* bitmap for matching */ 


long 




offset; 


/* current offset in jmp file */ 


struct 


diag 


*dx; 


/* holds diagonals */ 


struct 


path 


pp[2]; 


/* holds path for seqs */ 


char 




*calloc(), *malloc(), * 


indexQ, *strcpy(); 


char 




*getseq(), *g_calloc(); 
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Table 1 (conV) 

/* Needleman-Wunsch alignment program 
* 

* usage: progs filel file2 

* where filel and file2 are two dna or two protein sequences. 

* The sequences can be in upper- or lower-case an may contain ambiguity 

* Any lines beginning with ' ;','>' or ' < ' are ignored 

* Max file length is 65535 (limited by unsigned short x in the jmp struct) 

* A sequence with 1/3 or more of its elements ACGTU is assumed to be DNA 

* Output is in the file "align. out" 
* 

* The program may create a tmp file in /tmp to hold info about traceback. 

* Original version developed under BSD 4.3 on a vax 8650 
*/ 

^include "nw.h" 
^include "day.h" 

static _dbval[26] = { 

1 , 14,2, 13,0,0,4, 1 1 ,0,0, 12,0,3, 15,0,0,0,5,6,8,8,7,9,0, 10,0 

}; 

static _pbval[26] = { 

1, 2|(1<<( , D , - , A , ))|(1< <( , N , - , A , », 4, 8, 16, 32, 64, 
128, 256, OxFFFFFFF, 1< < 10, 1< < 1 1, 1< < 12, 1< < 13, 1< < 14, 
1<<15, 1<<16, 1<<17, 1<<18, 1< < 19, 1< <20, 1< <21, 1< <22, 
1 < <23, 1 < <24, 1 < <25 1(1 < <( , E , - , A'))|(1 < <('Q'-'A')) 

}; 

main(ac, av) 

int ac; 
char *av[]; 

{ 

prog = av[0]; 
if (ac != 3) { 

fprintf(stderr, "usage: %s filel flle2\n", prog); 

fprintf(stderr, "where filel and file2 are two dna or two protein sequences. \n"); 
fprintf(stderr,"The sequences can be in upper- or lower-case\n"); 
fprintf(stderr,"Any lines beginning with ';' or ' < ' are ignored\n"); 
t>rintf(stderr, "Output is in the file \"align.out\"\n"); 
exit(l); 

} 

namex[0] = av[l]; 

namex[l] = av[2]; 

seqx[0] = getseq(namex[0], &len0); 

seqx[l] = getseq(namex[l], &lenl); 

xbm = (dna)? _dbval : jpbval; 

endgaps =0; /* 1 to penalize endgaps */ 

ofile = "align.out"; /* output file */ 

nw(); /* fill in the matrix, get the possible jmps */ 

readjmps(); /* get the actual jmps */ 

print(); /* print stats, alignment */ 

cleanup(O); /* unlink any tmp files */ 
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Table 1 (conf) 

/* do the alignment, return best score: main() 

* dna: values in Fitch and Smith, PNAS, 80, 1382-1386, 1983 

* pro: PAM 250 values 

* When scores are equal, we prefer mismatches to any gap, prefer 

* a new gap to extending an ongoing gap, and prefer a gap in seqx 

* to a gap in seq y. 
*/ 

nw() 
{ 

char *px, *py; /* seqs and ptrs */ 

int *ndely, *dely; /* keep track of dely */ 

int ndelx, delx; /* keep track of delx */ 

int *tmp; /* for swapping rowO, rowl */ 

int mis; /* score for each type */ 

int insO, insl; /* insertion penalties */ 

register id; /* diagonal index */ 

register ij; /*jmp index*/ 

register *col0, *coll; /* score for curr, last row */ 

register xx, yy; /* index into seqs */ 



nw 



dx = (struct diag *)g_calloc("to get diags", IenO + lenl + 1, sizeof(struct diag)); 

ndely = (int *)g_calloc( M to get ndely", lenl + 1, sizeof(int)); 
dely = (int *)g_calloc("to get dely", lenl + 1, sizeof(int)); 
colO = (int *)g_calloc("to get colO", lenl + 1, sizeof(int)); 
coll — (int *)g_calloc("to get coll", lenl + 1, sizeof(int)); 
insO = (dna)? DINSO : PINSO; 
insl = (dna)? DINS1 : PINS1; 

smax = -10000; 
if (endgaps) { 

for (col0[0] = dely[0] = -insO, yy = 1; yy < = lenl; yy+ +) { 
col0[yy] = delyfyy] = col0[yy-l] - insl; 
ndely[yy] = yy; 

} 

col0[0] = 0; /* Waterman Bull Math Biol 84 */ 



} 

else 



for (yy = 1; yy < = lenl; yy+ +) 
dely[yy] = -insO; 



/* fill in match matrix 
*/ 

for (px = seqx[0], xx = 1; xx < = lenO; px+ + , xx+ +) { 
/* initialize first entry in col 
*/ 

if (endgaps) { 

if (xx == 1) 

coll[0] 

else 

coll[0] 
ndelx = xx; 



delx = -(insO + insl); 
delx = col0[0] - insl; 



} 

else { 



coll[0] = 0; 
delx = -insO; 
ndelx = 0; 
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Table 1 fcont^ 

...nw 

for (py = seqx[l], yy = 1; yy < = lenl; py++, yy++) { 
mis = colO[yy-l]; 
if (dna) 

5 mis + = (xbm[*px-'A']&xbm[*py-'A*])? DMAT : DMIS; 

else 

mis += __day[*px-'A , ][*py-'A']; 

/* update penalty for del in x seq; 
10 * favor new del over ongong del 

* ignore MAXGAP if weighting endgaps 
*/ 

if (endgaps | | ndely[yy] < MAXGAP) { 

if (col0[yy] - insO > = dely[yy]) { 
15 dely[yy] = col0[yy] - (insO + insl); 

ndely[yy] = 1; 

} else { 

dely[yy] -= insl; 
ndely[yy] + + ; 

20 } 

} else { 

if (coI0[yy] - (insO+insl) > = dely[yy]) { 

dely[yy] = col0[yy] - (insO + insl); 
ndely[yy] = 1; 

25 } else 

ndely[yy] + +; 

} 

/* update penalty for del in y seq; 
30 * favor new del over ongong del 

*/ 

if (endgaps | | ndelx < MAXGAP) { 

if (coll[yy-l] - insO > = delx) { 

delx = coll[yy-l] - (insO + insl); 
35 ndelx = 1; 

} else { 

delx -= insl; 
ndelx + + ; 

40 }else{ } 

if (coIl[yy-l] - (insO+insl) > = delx) { 
delx = coll[yy-l] - (insO + insl); 
ndelx = 1; 

} else 

45 ndelx++; 

} 

/* pick the maximum score; we're favoring 

* mis over any del and delx over dely 
50 */ 



55 
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Table 1 (conf) 

...nw 

id = xx - yy + lenl - 1; 
if (mis > = delx && mis > = dely[yy]) 
collfyy] = mis; 

5 else if (delx > = dely[yy]) { 

coll[yy] = delx; 
ij = dx[id].ijmp; 

if (dx[id].jp.n[0] && (!dna | | (ndelx > = MAXJMP 
&& xx > dx[id].jp.x[ij]+MX) | | mis > dx[id]. score +DINSO)) { 
10 dx[id].ijmp++; 

if (+ +ij > = MAXJMP) { 
writejmps(id); 
ij — dx[id].ijmp = 0; 
dx[id]. offset = offset; 

15 offset += sizeof(struct jmp) + sizeof (offset); 

dx[id] jp.n[ij] = ndelx; 
dx[id] jp.x[ij] = xx; 

20 dx[id]. score = delx; 

} 

else { 

coll[yy] = dely[yy]; 
ij = dx[id].ijmp; 

25 if (dx[id].jp.n[0] && (!dna | | (ndely[yy] > = MAXJMP 

&& xx > dx[id].jp.x[ij]+MX) | | mis > dx[id]. score +DINS0)) { 
dx[id].ijmp+ + ; 
if (++ij > = MAXJMP) { 
writejmps(id); 

30 ij = dx[id].ijmp = 0; 

dx[id]. offset = offset; 

offset + = sizeof (struct jmp) + sizeof (off set); 

} } 

35 dx[id].jp.n[ij] = -ndely[yy]; 

dx[id].jp.x[ij] = xx; 
dx[id]. score = dely[yy]; 

} 

if (xx = = lenO && yy < lenl) { 
40 /* last col 

*/ 

if (endgaps) 

coll[yy] -= ins0+insl*(lenl-yy); 
if (coll[yy] > smax) { 
45 smax = coll[yy]; 

dmax = id; 

} 

} } 

50 if (endgaps && xx < lenO) 

coll[yy-l] -= ins0+insl*(len0-xx); 
if (coll[yy-l] > smax) { 

smax = coll[yy-l]; 
dmax = id; 

55 } 

tmp = colO; colO = coll; coll = tmp; 

} 

(void) free((char *)ndely); 
(void) free((char *)dely); 



60 (void) free((char *)colO) 

(void) free((char *)coll): 



} 
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Table 1 (conV) 

/* 
* 

* print() — only routine visible outside this module 
* 

* static: 

* getmat() — trace back best path, count matches: print() 

* pr_align() — print alignment of described in array p[]: print() 

* dumpblock() — dump a block of lines with numbers, stars: pr_align() 

* nums() ~ put out a number line: dumpblock() 

* putline() - put out a line (name, [num], seq, [num]): dumpblock() 

* starsQ - -put a line of stars: dumpblock() 

* stripnameO — strip any path and prefix from a seqname 
*/ 

^include "nw.h" 



20 



^define SPC 3 
#define P JJNE 256 
#define P_SPC 3 

extern _day[26][26]; 
int olen; 
FILE *fx; 



/* maximum output line */ 

/* space between name or num and seq */ 



/* set output line length */ 
/* output file */ 



25 



30 



35 



40 



45 
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print() 
{ 



print 



lx, ly, firstgap, lastgap; /* overlap */ 
fopen(ofile, "w")) = = 0) { 

fprintf(stderr,"%s: can't write %s\n", prog, ofile); 
cleanup(l); 



int 

if ((fx 
} 

rprintf(fx, w < first sequence: %s (length = %d)\n", namex[0], lenO); 
fj>rintf(fx, "< second sequence: %s (length = %d)\n M , namex[l], lenl);. 
olen = 60; 
lx = lenO; 
ly = lenl; 

firstgap = lastgap = 0; 

if (dmax < lenl - 1) { /* leading gap in x */ 
pp[0]..spc = firstgap = lenl - dmax - 1; 
ly -= pp[0].spc; 

} 

else if (dmax > lenl - 1) { /* leading gap in y */ 
pp[l].spc = firstgap = dmax - (lenl - 1); 
lx -= pp[l].spc; 

} 

if (dmaxO < lenO - 1) { /* trailing gap in x */ 
lastgap = lenO - dmaxO -1; 
lx -= lastgap; 

} 

else if (dmaxO > lenO - 1) { /* trailing gap in y */ 
lastgap = dmaxO - (lenO - 1); 
ly -= lastgap; 

> 

getmat(lx, ly, firstgap, lastgap); 
pr_align(); 
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* trace back the best path, count matches 
*/ 

static 

getmat(lx, ly, firstgap, lastgap) 
int lx, Iy; 
int 



{ 



firstgap, lastgap; 



/* "core" (minus endgaps) */ 
/* leading trailing overlap */ 



getmat 



int 

char 

double 

register 

register char 



nm, iO, il, sizO, sizl; 

outx[32]; 

pet; 

nO, nl; 

*p0, *pl; 



/* get total matches, score 
*/ 

iO = il = sizO = sizl = 0; 
pO = seqx[0] + pp[l].spc; 
pi = seqx[l] + pp[0].spc; 
nO = pp[l].spc + 1; 
nl = pp[0].spc + 1; 

nm = 0; 

while ( *p0&& *pl ) { 
if (sizO) { 

pl + +; 
nl + + ; 
sizO--; 

} 

else if (sizl) { 

p0+ + ; 



n0+- 
sizl-- 



} 

else { 



if (xbm[*pO-'A , 3&xbm[*pl-'A']) 

nm+ + ; 
if (n0+ + = = pp[0].x[iO]) 

sizO = pp[0].n[iO++]; 
if (nl + + ==pp[l].x[il]) 

sizl = pp[l].n[il + +]; 

p0+ + ; 

pl + + ; 

} 

} 

/* pet homology: 

* if penalizing endgaps, base is the shorter seq 

* else, knock off overhangs and take shorter core 
*/ 

if (endgaps) 

lx = (lenO < lenl)? lenO : lenl; 

else 

lx = (lx < ly)? lx : ly; 
pet = 100.*(double)nm/(double)lx; 

fprmtf(fx, "W); 

fprintf(fx, " < %d match%s in an overlap of %d: %.2f percent similarity^", 
nm, (nm == 1)? "" : "es", lx, pet); 



60 



36 



Table 1 (eonf) 



fprintf(fx, " <gaps in first sequence: %d", gapx); 
if (gapx) { 

(void) sprintf(outx, " (%d %s%s)", 

ngapx, (dna)? "base":"residue", (ngapx == 1)? n ":"s"); 

fprintf(fx,"%s\ outx); 



.getmat 



} 



fprintf(fx, ", gaps in second sequence: %d", gapy); 
if (gapy) { 

(void) sprintf(outx, " (%d %s%s)", 

ngapy, (dna)? "base": "residue", (ngapy == 1)? "":"s"); 
rprintf(fx, M %s\ outx); 



} 

if (dna) 



else 



fprintf(fx, 

"\n< score: %d (match = %d, mismatch = %d, gap penalty = %d + %d per base)\n\ 
smax, DM AT, DMIS, DINSO, DINS1); 



fjprintf(fx, 

"\n< score: %d (Dayhoff PAM 250 matrix, gap penalty = %d + %d per residue)\n", 
smax, PINSO, PINS1); 
if (endgaps) 

fprintf(fx, 

"< endgaps penalized, left endgap: %d %s%s, right endgap: %d %s%s\n", 
firstgap, (dna)? "base" : "residue", (firstgap == 1)? "" : "s", 
lastgap, (dna)? "base" : "residue", (lastgap == 1)? "" : "s"); 



else 



fprintf(fx, "< endgaps not penalized\n"); 



static nm; 

static lmax; 

static ij[2]; 

static nc[2]; 

static ni[2]; 

static , siz[2]; 

static char *ps[2]; 

static char *po[2]; 

static char out[2][P_LINE]; /* output line */ 

static char star[P_LINE]; /* set by stars() */ 



/* matches in core — for checking */ 
/* lengths of stripped file names */ 
/* jmp index for a path */ 
/* number at start of current line */ 
/* current elem number — for gapping */ 

/* ptr to current element */ 
/* ptr to next output char slot */ 



/* 

* print alignment of described in struct path pp[] 
*/ 

static 

pr_align() 
{ 

int nn; /* char count */ 

int more; 
register i; 



pralign 



for (i = 0, lmax = 0; i < 2; i+ +) { 
nn = stripname(namex[i]); 
if (nn > lmax) 

lmax = nn; 



nc[i] = 1; 
ni[i] = 1; 
siz[i] = ij[i] = 0; 
ps[i] = seqx[i]; 
po[i] = out[i]; 
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Table 1 (conf) 

for (nn = nm = 0, more = 1; more; ) { ...prallgn 

for (i = more = 0; i < 2; i+ +) { 

/* 

5 * do we have more of this sequence? 

*/ 

if(!*ps[i]) 

continue; 

10 more++; 

if (pp[i].spc) { /* leading space */ 
*po[i] + + = *'; 
pp[i].spc--; 

15 } 

else if (siz[i]) { /* in a gap */ 
*po[i] + + = 
siz[i]--; 

20 else { /* we're putting a seq element 

*/ 

*po[i] = *ps[i]; 
if (islower(*ps[i])) 

*ps[i] = toupper(*ps[i]); 

25 po[i] + + ; 

ps[i]++; 

/* 

* are we at next gap for this seq? 

30 */ 

if(ni[ij ==pp[i].x[ij[i]]){ 
/* 

* we need to merge all gaps 

* at this location 

35 */ 

siz[i] = pp[i].n[ij[i] + +]; 
while (ni[i] == pp[i].x[ij[i]]) 

siz[i] += pp[i].n[ij[i] + +]; 

> 

40 ni[i]++; 

} 

} 

if ( + +nn = = olen 1 1 !more && nn) { 
dumpblockO; 

45 for(i = 0; i < 2; i++) 



50 



} 



po[i] = out[i]; 

nn = 0; 



/* 
* t 



'■ dump a block of lines, including numbers, stars: pr_align() 
*/ 

55 static 

dumpblockO dumpblock 

{ 

register i; 

60 for (i = 0; i < 2; i++) 

*po[i]-- = •\0'; 
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(void) putc('\n\ fx); 

for (i = 0; i < 2; i++) { 

if (*out[i] && (*out[i] != ' ' || *(po[i]) !='*)){ 
if(i ==0) 

nums(i); 
if(i == o&& *out[l]) 
stars(); 

putline(i); 

if (i == 0&& *out[l]) 

fprintf(fx, star); 
if(i — 1) 

nums(i); 



} 



.dump block 
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* put out a number line: dumpblock() 

*/ 

static 

nums(ix) 

int ix; /* index in out[] holding seq line */ 

{ 

char nline[P_LINE]; 

register i, j; 

register char *pn, *px, *py; 

for (pn = nline, i = 0; i < lmax + PSPC; i+ + , pn++) 
*pn = ' 

for (i = nc[ix], py = out[ix]; *py; py + +, pn-f +) { 

if(* Py ==" 1 1 *py == , -') 

*pn = ' 

else { 

if (i% 10 = = 0 | | (i = = 1 && nc[ix] ! = 1)) { 
j = (i < 0)? -i : i; 
for (px = pn; j; j /= 10, px-) 
*px = j%10 + '0'; 

if (i < 0) 

*px = *-'; 



nums 



} 



} 

else 

i++; 



*pn = ' 



} 

*pn = '\0'; 
nc[ix] = i; 

for (pn = nline; *pn; pn + + ) 
(void) putc(*pn, fx); 
(void) putc( ' \n ' , f x) ; 



55 /* 

* put out a line (name, [num], seq, [num]): dumpblock() 

*/ 

static 

putline(ix) 

60 int ix; { 



putline 
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Table 1 (conf) 

...put line 



int i; 
register char *px; 



for (px = namex[ix], i = 0; *px && *px != px+ + , i+ +) 

(void) putc(*px, fx); 
for(; i < lmax + P_SPC; i++) 

(void) putcC fx); 

/* these count from 1 : 

* ni[] is current element (from 1) 

* nc[] is number at start of current line 
*/ 

for (px = out[ix]; *px; px+ + ) 

(void) putc(*px&0x7F, fx); 
(void) putc(*\n\ fx); 



/* 

* put a line of stars (seqs always in out[0], out[l]): dumpblock() 

*/ 

static 

stars() stars 



{ 



int i; 

register char *p0, *pl, cx, *px; 

if (!*out[0] | | (*out[0] = ='•&& *(po[0]) == ' ') | | 
!*out[l] | | (*out[l] ==''&& *(po[l]) = ='')) 
return; 
px = star; 

for (i = lmax+PSPC; i; i~) 
*px+ + = ' '; 

for (pO = out[0], pi = out[l]; *p0 && *pl; p0++, pl + +) { 
if (isalpha(*pO) && isalpha(*pl)) { 

if (xbm[*pO- , A , ]&xbm[*pl- , A']) { 

cx = '*'; 
nm++; 

} 

ebeif (!dna && _day[*pO-'A'][*pl- , A'] > 0) 

cx = V; 



} 



} 



cx = 
*px+ + = cx; 

} 

*px++ = '\n'; 
*px = AO'; 
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Table 1 (conV) 

I* 

* strip path or prefix from pn, return len: pr_align() 
*/ 

static 

stripname(pn) Stripname 



{ 



char *pn; /* file name (may be path) */ 
register char *px, *py; 



10 py = 0; 

for (px = pn; *px; px + +) 
if (* px == V) 

py = px + 1; 

if(py) 

15 (void) strcpy(pn, py); 

return(strlen(pn)); 
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Table 1 (eolith) 

/* 

* cleanupO — cleanup any tmp file 

* getseqO ~ read in seq, set dna, len, maxlen 

* g_calloc() — callocO with error checkin 

* readjmps() ~ get the good jmps, from tmp file if necessary 

* writejmpsO — write a filled array of jmps to a tmp file: nw() 
*/ 

^include "nw.h" 
^include <sys/file.h> 



char *jname = 7tmp/homgXXXXXX ,, ; 

FILE *fj; 

int cleanupO; 

long lseek(); 



/* tmp file for jmps */ 
/* cleanup tmp file */ 
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/* 

* remove any tmp file if we blow 

*/ 

cleanup(i) 
{ 



int 

if(fj) 



exit(i); 



(void) unlink(jname); 



cleanup 
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/* 

* read, return ptr to seq, set dna, len, maxlen 

* skip lines starting with ' ; ' , ' < ' , or * > ' 

* seq in upper or lower case 
*/ 

char * 

getseq(file, len) 
char 
int 



*file; 
*len; 



char 

register char 
int 

FILE 



/* file name */ 
/* seq len */ 

line[1024], *pseq; 
*px, *py; 
natgc, tlen; 
*fp; 



getseq 



if ((fp = fopen(file,V')) == 0) { 

fprintf(stderr,"%s: can't read %s\n\ prog, file); 
exit(l); 

} 

tlen = natgc = 0; 

while (fgets(line, 1024, fp)) { 

if (*line == ';' | | *line == '<' | | *line == *>') 

continue; 
for (px = line; *px ! = '\n'; px+ + ) 

if (isupper(*px) | | islower(*px)) 
tlen++; 

} 

if ((pseq = malloc((unsigned)(tlen+6))) == 0) { 

fprintf(stderr,"%s: malloc() failed to get %d bytes for %s\n M , prog, tlen-f-6, file); 
exit(l); 

} 

pseq[0] = pseqfl] = pseq[2] = pseq[3] = '\0'; 
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...getseq 



Table 1 (conV) 

py = pseq 4- 4; 
*len = tlen; 
rewind(fp); 

5 

while (fgets(line, 1024, fp)) { 

if (*line = = V | | *line = = ' < * | | *line = = * > ') 
continue; 

for (px = line; *px != '\n*; px++) { 
10 if (isupper(*px)) 

*py++ = *px; 

else if (islower(*px)) 

*py+ + = toupper(*px); 
if (index("ATGCU\*(py-l))) 
15 natgc + + ; 

} 

} 

*py++ = '\0'; 
*py = '\0'; 
20 (void) fclose(fp); 

dna = natgc > (tlen/3); 
return(pseq+4); 

} 

25 char * 

g_calloc(msg, nx, sz) g^calloc 
char *msg; /* program, calling routine */ 

int nx, sz; /* number and size of elements */ 

30 char *px, *calloc(); 

if ((px = calloc((unsigned)nx, (unsigned) sz)) = = 0) { 
if (*msg) { 

fprintf(stderr, "%s: g_calloc() failed %s (n=%d, sz=%d)\n\ prog, msg, nx, sz); 
35 exit(l); 

} 

} 

retufn(px); 



} 

/* 

* get final jmps from dx[] or tmp file, set pp[], reset dmax: main() 
*/ 



readjmpsO readjmps 
45 { 

int fd = -1; 

int siz, iO, il; 

register i, j, xx; 

50 if(fj){ 

(void) fclose(fj); 

if ((fd = open(jname, O RDONLY, 0)) < 0) { 

Q>rintf(stderr, "%s: can't open() %s\n", prog, jname); 
cleanup(l); 

55 } 
} 

for (i = iO = il = 0, dmaxO = dmax, xx = lenO; ; i + +) {. 
while (1) { 

for (j = dx[dmax].ijmp; j > = 0 && dx[dmax].jp.x[j] > = xx; j--) 
60 ; 
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Table 1 (conf) 

...readjmps 

if (j < 0 && dxfdmax]. offset && fj) { 

(void) lseek(fd, dxfdmax]. offset, 0); 

(void) read(fd, (char *)&dx[dmax].jp, sizeof(struct jmp)); 

(void) read(fd, (char *)&dx[dmax].offset, sizeof(dx[dmax].offset)); 

dxfdmax]. ijmp = MAXJMP-1; 

} 

else 

break; 

} 

if (i > = JMPS) { 

fprintf(stderr, "%s: too many gaps in alignment\n", prog); 
cleanup(l); 

} 

if(j >=0){ 

siz = dxfdmax] .jp.n[j]; 
xx = dxfdmax] jp.x[j]; 
dmax + = siz; 

if (siz < 0) { /* gap in second seq */ 

ppfl].nfil] = -siz; 
xx + = siz; 

/* id = xx - yy 4- lenl - 1 
*/ 

ppfl].xfil] = xx - dmax + lenl - 1; 
gapy + + ; 
ngapy -= siz; 
ignore MAXGAP when doing endgaps */ 

siz = (-siz < MAXGAP | | endgaps)? -siz : MAXGAP; 
il + + ; 

} 

else if (siz > 0) { /* gap in first seq */ 
pp[0].nfi0] = siz; 
ppfoj.xfiO] = xx; 
gapx+ + ; 
ngapx -f = siz; 
ignore MAXGAP when doing endgaps */ 

siz = (siz < MAXGAP | | endgaps)? siz : MAXGAP; 
i0+ + ; 

} 

} 

else 

break; 

} 

/* reverse the order of jmps 
*/ 

for(j = 0, iO— ; j < i0;j + + , i0~) { 

i = pp[0].n[j]; pp[0].n[j] = pp[0].n[i0]; ppf0].nfi0] = i; 
i = pp[0].x[j]; pp[0].x[j] = pp[0].x[i0]; pp[0].x[i0] = i; 

} 

for (j = 0, il — ; j < il; j + il~) { 

i = PPtH.nlj]; pp[l].n[j] = ppfl].nfil]; ppfl].n[il] = i; 
i = pp[l].x[j]; pp[l].xU] = pp[l].x[il]; pp[l].x[il] = i; 

} 

if (fd >= 0) 

(void) close(fd); 

if(fi){ 

(void) unlink(jname); 
t] =0; 
offset = 0; 
} } 
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Table 1 (conn 

/* 

* write a filled jmp struct offset of the prev one (if any): nw() 
*/ 

5 writejmps(ix) writejmps 

int ix; 

{ 

char *mktemp(); 
10 if(!fj){ 

if (mktemp(jname) < 0) { 

rprintf(stderr, "%s: can't mktemp() %s\n", prog, jname); 
cleanup(l); 

15 if ((fj = fopen(jname, "w")) == 0) { 

fprintf(stderr, "%s: can't write %s\n", prog, jname); 
exit(l); 

} } 

20 (void) fwrite((char *)&dx[ix] jp, sizeof(struct jmp), 1, fj); 

(void) fwrite((char *)&dx[ix]. offset, sizeof(dx[ix], offset), 1, fj); 
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Table 2 



PRO XXXXXXXXXXXXXXX (Length = 15 amino acids) 

Comparison Protein XXXXXYYYYYYY (Length = 12 amino acids) 

5 % amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined 
by ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) = 

10 5 divided by 15 = 33.3% 

Table 3 

PRO XXXXXXXXXX (Length = 10 amino acids) 

Comparison Protein XXXXXYYYYYYZZYZ (Length = 15 amino acids) 



15 



20 



% amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined 
by ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) = 

5 divided by 10 = 50% 

Table 4 



PRO-DNA NNNNNNNNNNNNNN (Length = 14 nucleotides) 

25 Comparison DNA NNNNNNLLLLLLLLLL (Length = 16 nucleotides) 

% nucleic acid sequence identity = 

(the number of identically matching nucleotides between the two nucleic acid sequences as determined by ALIGN- 
30 2) divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) = 

6 divided by 14 = 42.9% 
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Table 5 

PRO-DNA NNNNNNNNNNNN (Length = 12 nucleotides) 

Comparison DNA NNNNLLLVV (Length = 9 nucleotides) 

5 % nucleic acid sequence identity = 

(the number of identically matching nucleotides between the two nucleic acid sequences as determined by ALIGN- 
2) divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) = 

10 4 divided by 12 = 33.3% 

II. Compositions and Methods of the Invention 

A. Full-Length PRQ842 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
15 referred to in the present application as PR0842 polypeptides. In particular, cDNAs encoding various PR0842 
polypeptides have been identified and isolated, as disclosed in further detail in the Examples below. It is noted 
that proteins produced in separate expression rounds may be given different PRO numbers but the UNQ number 
is unique for any given DNA and the encoded protein, and will not be changed. However, for sake of simplicity, 
in the present specification the protein encoded by the full length native nucleic acid molecules disclosed herein 
20 as well as all further native homologues and variants included in the foregoing definition of PRO, will be referred 
to as "PRO/number", regardless of their origin or mode of preparation. 

As disclosed in the Examples below, cDNA clones have been deposited with the ATCC. The actual 
nucleotide sequences of those clones can readily be determined by the skilled artisan by sequencing of the 
deposited clone using routine methods in the art. The predicted amino acid sequence can be determined from the 
25 nucleotide sequence using routine skill. For the PR0842 polypeptides and encoding nucleic acids described 
herein, Applicants have identified what is believed to be the reading frame best identifiable with the sequence 
information available at the time. 

B. PRQ842 Polypeptide Variants 

30 In addition to the full-length native sequence PR0842 polypeptides described herein, it is contemplated 

that PR0842 variants can be prepared. PR0842 variants can be prepared by introducing appropriate nucleotide 
changes into the PR0842 DNA, and/or by synthesis of the desired PR0842 polypeptide. Those skilled in the art 
will appreciate that amino acid changes may alter post-translational processes of the PR0842, such as changing 
the number or position of glycosylation sites or altering the membrane anchoring characteristics. 

35 Variations in the native full-length sequence PR0842 or in various domains of the PR0842 described 

herein, can be made, for example, using any of the techniques and guidelines for conservative and non- 
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conservative mutations set forth, for instance, in U.S. Patent No. 5,364,934. Variations may be a substitution, 
deletion or insertion of one or more codons encoding the PR0842 that results in a change in the amino acid 
sequence of the PR0842 as compared with the native sequence PR0842. Optionally the variation is by 
substitution of at least one amino acid with any other amino acid in one or more of the domains of the PR0842. 
Guidance in determining which amino acid residue may be inserted, substituted or deleted without adversely 
5 affecting the desired activity may be found by comparing the sequence of the PR0842 with that of homologous 
known protein molecules and minimizing the number of amino acid sequence changes made in regions of high 
homology. Amino acid substitutions can be the result of replacing one amino acid with another amino acid having 
similar structural and/or chemical properties, such as the replacement of a leucine with a serine, i.e. , conservative 
amino acid replacements. Insertions or deletions may optionally be in the range of about 1 to 5 amino acids. The 
10 variation allowed may be determined by systematically making insertions, deletions or substitutions of amino acids 
in the sequence and testing the resulting variants for activity exhibited by the full-length or mature native 
sequence. 

PR0842 polypeptide fragments are provided herein. Such fragments may be truncated at the N-terminus 
or C-terminus, or may lack internal residues, for example, when compared with a full length native protein. 
15 Certain fragments lack amino acid residues that are not essential for a desired biological activity of the PR0842 
polypeptide. 

PR0842 fragments may be prepared by any of a number of conventional techniques. Desired peptide 
fragments may be chemically synthesized. An alternative approach involves generating PR0842 fragments by 
enzymatic digestion, e.g., by treating the protein with an enzyme known to cleave proteins at sites defined by 

20 particular amino acid residues, or by digesting the DNA with suitable restriction enzymes and isolating the desired 
fragment. Yet another suitable technique involves isolating and amplifying a DNA fragment encoding a desired 
polypeptide fragment, by polymerase chain reaction (PCR). Oligonucleotides that define the desired termini of 
the DNA fragment are employed at the 5' and 3' primers in the PCR. Preferably, PR0842 polypeptide fragments 
share at least one biological and/or immunological activity with the native PR0842 polypeptide disclosed herein. 

25 In particular embodiments, conservative substitutions of interest are shown in Table 6 under the heading 

of preferred substitutions. If such substitutions result in a change in biological activity, then more substantial 
changes, denominated exemplary substitutions in Table 6, or as further described below in reference to amino acid 
classes, are introduced and the products screened. 
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Table 6 



Original Exemplary Preferred 

Residue Substitutions Substitutions 

5 Ala (A) val; leu; ile val 

Arg (R) lys; gin; asn lys 

Asn (N) gin; his; lys; arg gin 

Asp (D) glu glu 

Cys (C) ser ser 

10 Gln(Q) asn asn 

Glu (E) asp asp 

Gly (G) Pro; ala ala 

His (H) asn; gin; lys; arg arg 

Ile (I) leu; val; met; ala; phe; 

15 norleucine leu 

Leu (L) norleucine; ile; val; 

met; ala; phe ile 

Lys (K) arg; gin; asn arg 

Met (M) leu; phe; ile leu 

20 Phe (F) leu; val; ile; ala; tyr leu 

Pro (P) ala ala 

Ser (S) thr thr 

Thr (T) ser ser 

Trp (W) tyr; phe tyr 

25 Tyr (Y) trp; phe; thr; ser phe 

Val (V) ile; leu; met; phe; 

ala; norleucine leu 



Substantial modifications in function or immunological identity of the PR0842 polypeptide are 
30 accomplished by selecting substitutions that differ significantly in their effect on maintaining (a) the structure of 

the polypeptide backbone in the area of the substitution, for example, as a sheet or helical conformation, (b) the 

charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. Naturally occurring 

residues are divided into groups based on common side-chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 
35 (2) neutral hydrophilic: cys, ser, thr; 

(3) acidic: asp, glu; 

(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 

40 Non-conservative substitutions will entail exchanging a member of one of these classes for another class. 

Such substituted residues also may be introduced into the conservative substitution sites or, more preferably, into 
the remaining (non-conserved) sites. 

The variations can be made using methods known in the art such as oligonucleotide -mediated (site- 
directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis [Carter et al., Nucl. 

45 Acids Res. , 13:4331 (1986); Zoller et al. , Nucl. Acids Res. . 10:6487 (1987)], cassette mutagenesis (Wells et al. , 
Gene , 34:315 [1985]), restriction selection mutagenesis (Wells et al., Philos. Trans. R. Soc. London Ser A , 
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317:415 [1986]) or other known techniques can be performed on the cloned DNA to produce the PR0842 variant 
DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a 
contiguous sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. Such 
amino acids include alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid 
5 among this group because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the main- 
chain conformation of the variant (Cunningham and Wells, Science , 244 : 1081-1085 [1989]). Alanine is also 
typically preferred because it is the most common amino acid. Further, it is frequently found in both buried and 
exposed positions (Creighton, The Proteins , (W.H. Freeman & Co., N.Y.); Chothia, J. Mol. Biol. . 150 :1 
[1976]). If alanine substitution does not yield adequate amounts of variant, an isoteric amino acid can be used. 

10 

C. Modifications of PRQ842 
Covalent modifications of PR0842 are included within the scope of this invention. One type of covalent 
modification includes reacting targeted amino acid residues of a PR0842 polypeptide with an organic derivatizing 
agent that is capable of reacting with selected side chains or the N- or C- terminal residues of the PR0842. 

15 Derivatization with bifiinctional agents is useful, for instance, for crosslinking PR0842 to a water-insoluble 
support matrix or surface for use in the method for purifying anti-PR0842 antibodies, and vice- versa. Commonly 
used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide 
esters, for example, esters with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl 
esters such as 3,3*-ditMobis(succinimidylpropionate), bifiinctional maleimides such as bis-N-maleimido-l,8-octane 

20 and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the corresponding 
glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl 
groups of seryl or threonyl residues, methylation of the a-amino groups of lysine, arginine, and histidine side 
chains [T.E. Creighton, Proteins: Structure and Molecular Properties . W.H. Freeman & Co., San Francisco, 

25 pp. 79-86 (1983)], acetylation of the N-terminal amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the PR0842 polypeptide included within the scope of this 
invention comprises altering the native glycosylation pattern of the polypeptide. "Altering the native glycosylation 
pattern" is intended for purposes herein to mean deleting one or more carbohydrate moieties found in native 
sequence PR0842 (either by removing the underlying glycosylation site or by deleting the glycosylation by 

30 chemical and/or enzymatic means) , and/or adding one or more glycosylation sites that are not present in the native 
sequence PR0842. In addition, the phrase includes qualitative changes in the glycosylation of the native proteins, 
involving a change in the nature and proportions of the various carbohydrate moieties present. 

Addition of glycosylation sites to the PR0842 polypeptide may be accomplished by altering the amino 
acid sequence. The alteration may be made, for example, by the addition of, or substitution by, one or more 

35 serine or threonine residues to the native sequence PR0842 (for O-linked glycosylation sites). The PR0842 
amino acid sequence may optionally be altered through changes at the DNA level, particularly by mutating the 
DNA encoding the PR0842 polypeptide at preselected bases such that codons are generated that will translate into 
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the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the PR0842 polypeptide is by 
chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the art, e.g., 
in WO 87/05330 published 11 September 1987, and in Aplin and Wriston, CRC Crit. Rev. Biochem. , pp. 259- 
306 (1981). 

5 Removal of carbohydrate moieties present on the PR0842 polypeptide may be accomplished chemically 

or enzymatically or by mutational substitution of codons encoding for amino acid residues that serve as targets 
for glycosylation. Chemical deglycosylation techniques are known in the art and described, for instance, by 
Hakimuddin, et al., Arch. Biochem. Biophvs. . 259:52 (1987) and by Edge et al., Anal. Biochem. . 118:131 
(1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety of 
10 endo- and exo-glycosidases as described by Thotakura et al., Meth. Enzymol. . 138 :350 (1987). 

Another type of covalent modification of PR0842 comprises linking the PR0842 polypeptide to one of 
a variety of nonproteinaceous polymers, e.g., polyethylene glycol (PEG), polypropylene glycol, or 
polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 
4,791,192 or 4,179,337. 

15 The PR0842 of the present invention may also be modified in a way to form a chimeric molecule 

comprising PR0842 fused to another, heterologous polypeptide or amino acid sequence. 

In one embodiment, such a chimeric molecule comprises a fusion of the PR0842 with a tag polypeptide 
which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is generally placed 
at the amino- or carboxyl- terminus of the PR0842. The presence of such epitope-tagged forms of the PR0842 

20 can be detected using an antibody against the tag polypeptide. Also, provision of the epitope tag enables the 
PR0842 to be readily purified by affinity purification using an anti-tag antibody or another type of affinity matrix 
that binds to the epitope tag. Various tag polypeptides and their respective antibodies are well known in the art. 
Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag polypeptide 
and its antibody 12CA5 [Field et al., Mol. Cell. Biol. , 8:2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 

25 6E10, G4, B7 and 9E10 antibodies thereto [Evan et al., Molecular and Cellular Biology . 5:3610-3616 (1985)]; 
and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky et al. , Protein Engineering , 
3£6):547-553 (1990)]. Other tag polypeptides include the Flag-peptide [Hopp etal.. BioTechnology . 6: 1204-1210 
(1988)]; the KT3 epitope peptide [Martin et al., Science . 255:192-194 (1992)]; an a-tubulin epitope peptide 
[Skinner et al., J. Biol. Chem. . 266:15163-15166 (1991)]; and the T7 gene 10 protein peptide tag [Lutz- 

30 Freyermuth et al. , Proc. Natl. Acad. Sci. USA . 87:6393-6397 (1990)]. 

In an alternative embodiment, the chimeric molecule may comprise a fusion of the PR0842 with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the chimeric molecule (also 
referred to as an "immunoadhesin"), such a fusion could be to the Fc region of an IgG molecule. The Ig fusions 
preferably include the substitution of a soluble (transmembrane domain deleted or inactivated) form of a PR0842 

35 polypeptide in place of at least one variable region within an Ig molecule. In a particularly preferred embodiment, 
the immunoglobulin fusion includes the hinge, CH2 and CH3, or the hinge, CHI, CH2 and CH3 regions of an 
IgGl molecule. For the production of immunoglobulin fusions see also US Patent No. 5,428, 130 issued June 27, 
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1995. 

In yet a further embodiment, the PR0842 polypeptides of the present invention may also be modified 
in a way to form a chimeric molecule comprising a PR0842 polypeptide fused to a leucine zipper. Various 
leucine zipper polypeptides have been described in the art. See, e.g., Landschulz et al., Science. 240:1759 
(1988); WO 94/10308; Hoppe et al, FEBS Letters. 344:1991 (1994); Maniatis et al, Nature. 341:24 (1989). 
It is believed that use of a leucine zipper fused to a PR0842 polypeptide may be desirable to assist in dimerizing 
or trimerizing soluble PR0842 polypeptide in solution. Those skilled in the art will appreciate that the leucine 
zipper may be fused at either the - or C-terminal end of the PR0842 molecule. 

D. Preparation of PRQ842 
The description below relates primarily to production of PR0842 by culturing cells transformed or 
transfected with a vector containing PR0842 nucleic acid: It is, of course, contemplated that alternative methods, 
which are well known in the art, may be employed to prepare PR0842. For instance, the PR0842 sequence, or 
portions thereof, may be produced by direct peptide synthesis using solid-phase techniques [see, e.g. , Stewart et 
al., Solid-Phase Peptide Synthesis . W.H. Freeman Co., San Francisco, CA (1969); Merrifield, J. Am. Chem. 
Soc , 85:2149-2154 (1963)]. In vitro protein synthesis may be performed using manual techniques or by 
automation. Automated synthesis may be accomplished, for instance, using an Applied Biosystems Peptide 
Synthesizer (Foster City, CA) using manufacturer's instructions. Various portions of the PR0842 may be 
chemically synthesized separately and combined using chemical or enzymatic methods to produce the full-length 
PR0842. 

1. Isolation of DNA Encoding PRQ842 

DNA encoding PR0842 may be obtained from a cDNA library prepared from tissue believed to possess 
the PR0842 mRNA and to express it at a detectable level. Accordingly, human PR0842 DNA can be 
conveniently obtained from a cDNA library prepared from human tissue, such as described in the Examples. The 
PR0842-encoding gene may also be obtained from a genomic library or by known synthetic procedures (e.g., 
automated nucleic acid synthesis). 

Libraries can be screened with probes (such as antibodies to the PR0842 or oligonucleotides of at least 
about 20-80 bases) designed to identify the gene of interest or the protein encoded by it. Screening the cDNA 
or genomic library with the selected probe may be conducted using standard procedures, such as described in 
Sambrook et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 
1989). An alternative means to isolate the gene encoding PR0842 is to use PCR methodology [Sambrook et al. , 
supra; Dieffenbach et al, PCR Primer: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 1995)]. 

The Examples below describe techniques for screening a cDNA library. The oligonucleotide sequences 
selected as probes should be of sufficient length and sufficiently unambiguous that false positives are minimized. 
The oligonucleotide is preferably labeled such that it can be detected upon hybridization to DNA in the library 
being screened. Methods of labeling are well known in the art, and include the use of radiolabels like 32 P-labeled 
ATP, biotinylation or enzyme labeling. Hybridization conditions, including moderate stringency and high 
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stringency, are provided in Sambrook et al., supra. 

Sequences identified in such library screening methods can be compared and aligned to other known 
sequences deposited and available in public databases such as GenBank or other private sequence databases. 
Sequence identity (at either the amino acid or nucleotide level) within defined regions of the molecule or across 
the full-length sequence can be determined using methods known in the art and as described herein. 
5 Nucleic acid having protein coding sequence may be obtained by screening selected cDNA or genomic 

libraries using the deduced amino acid sequence disclosed herein for the first time, and, if necessary, using 
conventional primer extension procedures as described in Sambrook et al., supra, to detect precursors and 
processing intermediates of mRNA that may not have been reverse-transcribed into cDNA. 

10 2. Selection and Transformation of Host Cells 

Host cells are transfected or transformed with expression or cloning vectors described herein for PR0842 
production and cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting 
transformants, or amplifying the genes encoding the desired sequences. The culture conditions, such as media, 
temperature, pH and the like, can be selected by the skilled artisan without undue experimentation. In general, 

15 principles, protocols, and practical techniques for maximizing the productivity of cell cultures can be found in 
Mammalian Cell Biotechnology: a Practical Approach . M. Butler, ed. (IRL Press, 1991) and Sambrook etal., 
supra. 

Methods of eukaryotic cell transfection and prokaryotic cell transformation are known to the ordinarily 
skilled artisan, for example, CaCl 2 , CaP0 4 , liposome-mediated and electroporation. Depending on the host cell 

20 used, transformation is performed using standard techniques appropriate to such cells. The calcium treatment 
employing calcium chloride, as described in Sambrook et al., supra, or electroporation is generally used for 
prokaryotes. Infection with Agrobacterium tumefaciens is used for transformation of certain plant cells, as 
described by Shaw et al., Gene , 23:315 (1983) and WO 89/05859 published 29 June 1989. For mammalian cells 
without such cell walls, the calcium phosphate precipitation method of Graham and van der Eb, Virology , 52:456- 

25 457 (1978) can be employed. General aspects of mammalian cell host system transfections have been described 
in U.S. Patent No. 4,399,216. Transformations into yeast are typically carried out according to the method of 
Van Solingen etal, J. Bact. . 130:946 (1977) and Hsiao etal., Proc. Natl. Acad. Sci. (USA) . 76:3829 (1979). 
However, other methods for introducing DNA into cells, such as by nuclear microinjection, electroporation, 
bacterial protoplast fusion with intact cells, or polycations, e.g. , polybrene, polyornithine, may also be used. For 

30 various techniques for transforming mammalian cells, see Keown et al., Methods in Enzvmology , 185 :527-537 
(1990) and Mansour etal, Nature , 336:348-352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, yeast, 
or higher eukaryote cells. Suitable prokaryotes include but are not limited to eubacteria, such as Gram-negative 
or Gram-positive organisms, for example, Enterobacteriaceae such as E. coli. Various E. coli strains are publicly 

35 available, such as E. coli K12 strain MM294 (ATCC 31,446); E. coli X1776 (ATCC 31,537); E. coli strain 
W3110 (ATCC 27,325) and K5 772 (ATCC 53,635). Other suitable prokaryotic host cells include 
Enterobacteriaceae such as Escherichia, e.g., E. coli, Enterobacter, Erwinia, Klebsiella, Proteus, Salmonella, 
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e.g., Salmonella typhimurium, Serratia, e.g., Serratia marcescans, and Shigella, as well as Bacilli such as 5. 
subtilis and 5. licheniformis {e.g., B. licheniformis 41P disclosed in DD 266,710 published 12 April 1989), 
Pseudomonas such as P. aeruginosa, and Streptomyces . These examples are illustrative rather than limiting. 
Strain W31 10 is one particularly preferred host or parent host because it is a common host strain for recombinant 
DNA product fermentations. Preferably, the host cell secretes minimal amounts of proteolytic enzymes. For 
5 example, strain W31 10 may be modified to effect a genetic mutation in the genes encoding proteins endogenous 
to the host, with examples of such hosts including E. coli W3110 strain 1A2, which has the complete genotype 
tonA ; E. coli W31 10 strain 9E4, which has the complete genotype tonA ptr3\ E. coli W31 10 strain 27C7 (ATCC 
55,244), which has the complete genotype tonA ptr3 phoA E15 (argF-lac)169 degP ompTkan r ; E. coli W31 10 
strain 37D6, which has the complete genotype tonA ptr3 phoA El 5 (argF-lac)169 degP ompT rbs7 ilvG kan r ; 

10 E. coli W31 10 strain 40B4, which is strain 37D6 with a non-kanamycin resistant degP deletion mutation; and an 
E. coli strain having mutant periplasmic protease disclosed in U.S. Patent No. 4,946,783 issued 7 August 1990. 
Alternatively, in vitro methods of cloning, e.g., PCR or other nucleic acid polymerase reactions, are suitable. 

In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable cloning 
or expression hosts for PR0842-encoding vectors. Saccharomyces cerevisiae is a commonly used lower 

15 eukaryotic host microorganism. Others include Schizosaccharomyces pombe [Beach and Nurse, Nature , 290 : 140 
(1981); EP 139,383 published 2 May 1985]; Kluyveromyces hosts (U.S. Patent No. 4,943,529; Fleer et al., 
Bio/Technoloev . 9:968-975 [1991]) such as, e.g. , K. lactis (MW98-8C, CBS683, CBS4574; Louvencourt et al. , 
J.Bacteriol. . 154(2) : 737-742 [1983]), K. fragilis (ATCC 12,424), K. bulgaricus (ATCC 16,045), K. wickeramii 
(ATCC 24,178), K. waltii (ATCC 56,500), K. drosophilarum (ATCC 36,906; Van den Berg et al., 

20 Bio/Technology . 8:135 [1990]), K. thermotolerans, and K. marxianus; yarrowia (EP 402,226); Pichia pastoris 
(EP 183,070; Sreekrishna et al., J. Basic Microbiol. . 28:265-278 [1988]); Candida; Trichoderma reesia (EP 
244,234); Neurospora crassa (Case et al., Proc. Natl. Acad. Sci. USA . 76:5259-5263 [1979]); Schwanniomyces 
such as Schwanniomyces occidentalis (EP 394,538 published 31 October 1990); and filamentous fungi such as, 
e.g., Neurospora, Penicillium, Tolypocladium (WO 91/00357 published 10 January 1991), and Aspergillus hosts 

25 such as A. nidulans (Ballance et al., Biochem. Biophvs. Res. Commun.. 112:284-289 [1983]; Tilburn et al., 
Gene . 26:205-221 [1983]; Yeltonefa/., Proc. Natl. Acad. Sci. USA . 81:1470-1474 [1984]) and A. niger (Kelly 
and Hynes, EMBO J. . 4:475-479 [1985]). Methylotropic yeasts are suitable herein and include, but are not 
limited to, yeast capable of growth on methanol selected from the genera consisting of Hansenula, Candida, 
Kloeckera, Pichia, Saccharomyces, Torulopsis, and Rhodotorula. A list of specific species that are exemplary 

30 of this class of yeasts may be found in C. Anthony, The Biochemistry of Methvlotrophs . 269 (1982). 

Suitable host cells for the expression of glycosylated PR0842 are derived from multicellular organisms. 
Examples of invertebrate cells include insect cells such as Drosophila S2 and Spodoptera Sf9 or Spodoptera High 
5 cells, as well as plant cells. Examples of useful mammalian host cell lines include Chinese hamster ovary 
(CHO) and COS cells. More specific examples include monkey kidney CV1 line transformed by SV40 (COS-7, 

35 ATCC CRL 1651); human embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture, 
Graham et al., J. Gen Virol. . 36:59 (1977)); Chinese hamster ovary cells/-DHFR (CHO, Urlaub and Chasin, 
Proc. Natl. Acad. Sci. USA . 77:4216 [1980]); mouse Sertoli cells (TM4, Mather, Biol. Reprod. . 23:243-251 
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[1980]); human lung cells (W138, ATCC CCL 75); human liver cells (Hep G2, HB 8065); and mouse mammary 
tumor (MMT 060562, ATCC CCL51). The selection of the appropriate host cell is deemed to be within the skill 
in the art. 



3. Selection and Use of a Replicable Vector 

5 The nucleic acid (e.g., cDNA or genomic DNA) encoding PR0842 may be inserted into a replicable 

vector for cloning (amplification of the DNA) or for expression. Various vectors are publicly available. The 
vector may, for example, be in the form of a plasmid, cosmid, viral particle, or phage. The appropriate nucleic 
acid sequence may be inserted into the vector by a variety of procedures. In general, DNA is inserted into an 
appropriate restriction endonuclease site(s) using techniques known in the art. Vector components generally 
10 include, but are not limited to, one or more of a signal sequence, an origin of replication, one or more marker 
genes, an enhancer element, a promoter, and a transcription termination sequence. Construction of suitable 
vectors containing one or more of these components employs standard ligation techniques which are known to the 
skilled artisan. 

The PR0842 may be produced recombinantly not only directly, but also as a fusion polypeptide with a 

15 heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site 
at the N-terminus of the mature protein or polypeptide. In general, the signal sequence may be a component of 
the vector, or it may be a part of the PR0842-encoding DNA that is inserted into the vector. The signal sequence 
may be a prokaryotic signal sequence selected, for example, from the group of the alkaline phosphatase, 
penicillinase, lpp, or heat-stable enterotoxin II leaders. For yeast secretion the signal sequence may be, e.g. , the 

20 yeast invertase leader, alpha factor leader (including Saccharomyces and Kluyveromyces oc-f actor leaders, the latter 
described in U.S. Patent No. 5,010,182), or acid phosphatase leader, the C. albicans glucoamylase leader (EP 
362,179 published 4 April 1990), or the signal described in WO 90/13646 published 15 November 1990. In 
mammalian cell expression, mammalian signal sequences may be used to direct secretion of the protein, such as 
signal sequences from secreted polypeptides of the same or related species, as well as viral secretory leaders. 

25 Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate 

in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. 
The origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2fi plasmid 
origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for 
cloning vectors in mammalian cells. 

30 Expression and cloning vectors will typically contain a selection gene, also termed a selectable marker. 

Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g. , ampicillin, 
neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical nutrients 
not available from complex media, e.g. , the gene encoding D-alanine racemase for Bacilli. 

An example of suitable selectable markers for mammalian cells are those that enable the identification 

35 of cells competent to take up the PR0842-encoding nucleic acid, such as DHFR or thymidine kinase. An 
appropriate host cell when wild-type DHFR is employed is the CHO cell line deficient in DHFR activity, prepared 
and propagated as described by Urlaub etal., Proc. Natl. Acad. Sci. USA . 77:4216 (1980). A suitable selection 





gene for use in yeast is the //pi gene present in the yeast plasmid YRp7 [Stinchcomb et aL, Nature , 282:39 
(1979); Kingsman etaL, Gene . 7:141 (1979); Tschemper etaL, Gene . 10:157 (1980)]. The trp\ gene provides 
a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for example, ATCC No. 
44076 or PEP4-1 [Jones, Genetics . 85:12 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the PR0842-encoding 
5 nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host cells are 
well known. Promoters suitable for use with prokaryotic hosts include the P-lactamase and lactose promoter 
systems [Chang etaL, Nature . 275:615 (1978); Goeddel etaL, Nature . 281:544 (1979)], alkaline phosphatase, 
a tryptophan (trp) promoter system [Goeddel, Nucleic Acids Res. . 8:4057 (1980); EP 36,776], and hybrid 
promoters such as the tac promoter [deBoer et aL, Proc. Natl. Acad. Sci. USA . 80:21-25 (1983)]. Promoters 
10 for use in bacterial systems also will contain a Shine-Dalgarno (S.D.) sequence operably linked to the DNA 
encoding PR0842. 

Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3- 
phosphogiycerate kinase [Hitzeman etaL, J. Biol. Chem. . 255:2073 (1980)] or other glycolytic enzymes [Hess 
et aL, J. Adv. Enzyme Reg. . 7:149 (1968); Holland, Biochemistry . 17:4900 (1978)], such as enolase, 

15 glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucoses- 
phosphate isomerase, 3 -phosphogiycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose 
isomer ase, and glucokinase. 

Other yeast promoters, which are inducible promoters having the additional advantage of transcription 
controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid 

20 phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde-3- 
phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and 
promoters for use in yeast expression are further described in EP 73,657. 

PR0842 transcription from vectors in mammalian host cells is controlled, for example, by promoters 
obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,211,504 published 5 July 

25 1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a 
retrovirus, hepatitis-B virus and Simian Virus 40 (SV40), from heterologous mammalian promoters, e.g. , the 
actin promoter or an immunoglobulin promoter, and from heat-shock promoters, provided such promoters are 
compatible with the host cell systems. 



30 enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 bp, 
that act on a promoter to increase its transcription. Many enhancer sequences are now known from mammalian 
genes (globin, elastase, albumin, a-fetoprotein, and insulin). Typically, however, one will use an enhancer from 
a eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the replication origin (bp 100- 
270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication 

35 origin, and adenovirus enhancers. The enhancer may be spliced into the vector at a position 5' or 3' to the 
PR0842 coding sequence, but is preferably located at a site 5' from the promoter. 



Transcription of a DNA encoding the PR0842 by higher eukaryotes may be increased by inserting an 



Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated 
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cells from other multicellular organisms) will also contain sequences necessary for the termination of transcription 
and for stabilizing the mRNA. Such sequences are commonly available from the 5' and, occasionally 3', 
untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide segments 
transcribed as polyadenylated fragments in the untranslated portion of the mRNA encoding PR0842. 

Still other methods, vectors, and host cells suitable for adaptation to the synthesis of PR0842 in 
5 recombinant vertebrate cell culture are described in Gething et al., Nature . 293:620-625 (1981); Mantei et al, 
Nature . 281:40-46 (1979); EP 117,060; and EP 117,058. 

4. Detecting Gene Amplification/Expression 

Gene amplification and/or expression may be measured in a sample directly, for example, by 
conventional Southern blotting, Northern blotting to quantitate the transcription of mRNA (Thomas, Proc. Natl. 
Acad. Sci. USA . 77:5201-5205 [1980]), dot blotting (DNA analysis), or in situ hybridization, using an 
appropriately labeled probe, based on the sequences provided herein. Alternatively, antibodies may be employed 
that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes 
or DNA-protein duplexes. The antibodies in turn may be labeled and the assay may be carried out where the 
duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence of antibody bound 
to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 
immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to quantitate 
directly the expression of gene product. Antibodies useful for immunohistochemical staining and/or assay of 
sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, the 
antibodies may be prepared against a native sequence PR0842 polypeptide or against a synthetic peptide based 
on the DNA sequences provided herein or against exogenous sequence fused to PR0842 DNA and encoding a 
specific antibody epitope. 

25 5. Purification of Polypeptide 

Forms of PR0842 may be recovered from culture medium or from host cell lysates. If membrane- 
bound, it can be released from the membrane using a suitable detergent solution {e.g., Triton-X 100) or by 
enzymatic cleavage. Cells employed in expression of PR0842 can be disrupted by various physical or chemical 
means, such as freeze-thaw cycling, sonication, mechanical disruption, or cell lysing agents. 

30 It may be desired to purify PR0842 from recombinant cell proteins or polypeptides. The following 

procedures are exemplary of suitable purification procedures: by fractionation on an ion-exchange column; ethanol 
precipitation; reverse phase HPLC; chromatography on silica or on a cation-exchange resin such as DEAE; 
chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, for example, SephadexG-75; 
Protein A Sepharose columns to remove contaminants such as IgG; and metal chelating columns to bind epitope- 

35 tagged forms of the PR0842. Various methods of protein purification may be employed and such methods are 
known in the art and described for example in Deutscher, Methods in Enzvmologv . 182 (1990); Scopes, Protein 
Purification: Principles and Practice . Springer- Verlag, New York (1982). The purification step(s) selected will 
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depend, for example, on the nature of the production process used and the particular PR0842 produced. 

E. Uses for PRQ842 

Nucleotide sequences (or their complement) encoding PR0842 have various applications in the art of 
molecular biology, including uses as hybridization probes, in chromosome and gene mapping and in the generation 
5 of anti-sense RNA and DNA. PR0842 nucleic acid will also be useful for the preparation of PR0842 
polypeptides by the recombinant techniques described herein. 

The full-length native sequence PR0842 gene, or portions thereof, may be used as hybridization probes 
for a cDNA library to isolate the full-length PR0842 cDNA or to isolate still other cDNAs (for instance, those 
encoding naturally-occurring variants of PR0842 or PR0842 from other species) which have a desired sequence 
10 identity to the native PR0842 sequence disclosed herein. Optionally, the length of the probes will be about 20 to 
about 50 bases. The hybridization probes may be derived from at least partially novel regions of the full length 
native nucleotide sequence wherein those regions may be determined without undue experimentation or from 
genomic sequences including promoters, enhancer elements and introns of native sequence PR0842. By way of 
example, a screening method will comprise isolating the coding region of the PR0842 gene using the known DNA 

15 sequence to synthesize a selected probe of about 40 bases. Hybridization probes may be labeled by a variety of 
labels, including radionucleotides such as 32 P or 35 S, or enzymatic labels such as alkaline phosphatase coupled to 
the probe via avidin/biotin coupling systems. Labeled probes having a sequence complementary to that of the 
PR0842 gene of the present invention can be used to screen libraries of human cDNA, genomic DNA or mRNA 
to determine which members of such libraries the probe hybridizes to. Hybridization techniques are described 

20 in further detail in the Examples below. 

Any EST sequences disclosed in the present application may similarly be employed as probes, using the 
methods disclosed herein. 

Other useful fragments of the PR0842 nucleic acids include antisense or sense oligonucleotides 
comprising a singe-stranded nucleic acid sequence (either RNA or DNA) capable of binding to target PR0842 

25 mRNA (sense) or PR0842 DNA (antisense) sequences. Antisense or sense oligonucleotides, according to the 
present invention, comprise a fragment of the coding region of PR0842 DNA. Such a fragment generally 
comprises at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an 
antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given protein is described in, for 
example, Stein and Cohen ( Cancer Res. 48:2659, [1988]) and van der Krol et ah (BioTechniques. 6 :958. [1988]). 

30 Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in the formation 

of duplexes that block transcription or translation of the target sequence by one of several means, including 
enhanced degradation of the duplexes, premature termination of transcription or translation, or by other means. 
The antisense oligonucleotides thus may be used to block expression of PR0842 proteins. Antisense or sense 
oligonucleotides further comprise oligonucleotides having modified sugar-phosphodiester backbones (or other 

35 sugar linkages, such as those described in WO 91/06629) and wherein such sugar linkages are resistant to 
endogenous nucleases. Such oligonucleotides with resistant sugar linkages are stable in vivo (i.e., capable of 
resisting enzymatic degradation) but retain sequence specificity to be able to bind to target nucleotide sequences. 
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Other examples of sense or antisense oligonucleotides include those oligonucleotides which are covalently 
linked to organic moieties, such as those described in WO 90/10048, and other moieties that increases affinity 
of the oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Further still, intercalating 
agents, such as ellipticine, and alkylating agents or metal complexes may be attached to sense or antisense 
oligonucleotides to modify binding specificities of the antisense or sense oligonucleotide for the target nucleotide 
5 sequence. 

Antisense or sense oligonucleotides may be introduced into a cell containing the target nucleic acid 
sequence by any gene transfer method, including, for example, CaP0 4 -mediated DNA transfection, 
electroporation, or by using gene transfer vectors such as Epstein-Barr virus. In a preferred procedure, an 
antisense or sense oligonucleotide is inserted into a suitable retroviral vector. A cell containing the target nucleic 
10 acid sequence is contacted with the recombinant retroviral vector, either in vivo or ex vivo. Suitable retroviral 
vectors include, but are not limited to, those derived from the murine retrovirus M-MuLV, N2 (a retrovirus 
derived from M-MuLV), or the double copy vectors designated DCT5A, DCT5B and DCT5C (see WO 
90/13641). 

Sense or antisense oligonucleotides also may be introduced into a cell containing the target nucleotide 
15 sequence by formation of a conjugate with a ligand binding molecule, as described in WO 91/04753. Suitable 
ligand binding molecules include, but are not limited to, cell surface receptors, growth factors, other cytokines, 
or other ligands that bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 
not substantially interfere with the ability of the ligand binding molecule to bind to its corresponding molecule or 
receptor, or block entry of the sense or antisense oligonucleotide or its conjugated version into the cell. 
20 Alternatively, a sense or an antisense oligonucleotide may be introduced into a cell containing the target 

nucleic acid sequence by formation of an oligonucleotide-lipid complex, as described in WO 90/10448. The sense 
or antisense oligonucleotide-lipid complex is preferably dissociated within the cell by an endogenous lipase. 

Antisense or sense RNA or DNA molecules are generally at least about 5 bases in length, about 10 bases 
in length, about 15 bases in length, about 20 bases in length, about 25 bases in length, about 30 bases in length, 
25 about 35 bases in length, about 40 bases in length, about 45 bases in length, about 50 bases in length, about 55 
bases in length, about 60 bases in length, about 65 bases in length, about 70 bases in length, about 75 bases in 
length, about 80 bases in length, about 85 bases in length, about 90 bases in length, about 95 bases in length, 
about 100 bases in length, or more. 

The probes may also be employed in PCR techniques to generate a pool of sequences for identification 
30 of closely related PR0842 coding sequences. 

Nucleotide sequences encoding a PR0842 can also be used to construct hybridization probes for mapping 
the gene which encodes that PR0842 and for the genetic analysis of individuals with genetic disorders. The 
nucleotide sequences provided herein may be mapped to a chromosome and specific regions of a chromosome 
using known techniques, such as in situ hybridization, linkage analysis against known chromosomal markers, and 
35 hybridization screening with libraries. 

When the coding sequences for PR0842 encode a protein which binds to another protein (example, where 
the PR0842 is a receptor), the PR0842 can be used in assays to identify the other proteins or molecules involved 



in the binding interaction. By such methods, inhibitors of the receptor/1 igand binding interaction can be identified, 
proteins involved in such binding interactions can also be used to screen for peptide or small molecule inhibitors 
or agonists of the binding interaction. Also, the receptor PR0842 can be used to isolate correlative ligand(s). 
Screening assays can be designed to find lead compounds that mimic the biological activity of a native PR0842 
or a receptor for PR0842. Such screening assays will include assays amenable to high-throughput screening of 
5 chemical libraries, making them particularly suitable for identifying small molecule drug candidates. Small 
molecules contemplated include synthetic organic or inorganic compounds. The assays can be performed in a 
variety of formats, including protein-protein binding assays, biochemical screening assays, immunoassays and cell 
based assays, which are well characterized in the art. 

Nucleic acids which encode PR0842 or its modified forms can also be used to generate either transgenic 

10 animals or "knock out" animals which, in turn, are useful in the development and screening of therapeutically 
useful reagents. A transgenic animal (e.g., a mouse or rat) is an animal having cells that contain a transgene, 
which transgene was introduced into the animal or an ancestor of the animal at a prenatal, e.g., an embryonic 
stage. A transgene is a DNA which is integrated into the genome of a cell from which a transgenic animal 
develops. In one embodiment, cDNA encoding PR0842 can be used to clone genomic DNA encoding PR0842 

15 in accordance with established techniques and the genomic sequences used to generate transgenic animals that 
contain cells which express DNA encoding PR0842. Methods for generating transgenic animals, particularly 
animals such as mice or rats, have become conventional in the art and are described, for example, in U.S. Patent 
Nos. 4,736,866 and 4,870,009. Typically, particular cells would be targeted for PR0842 transgene incorporation 
with tissue-specific enhancers. Transgenic animals that include a copy of a transgene encoding PR0842 

20 introduced into the germ line of the animal at an embryonic stage can be used to examine the effect of increased 
expression of DNA encoding PR0842. Such animals can be used as tester animals for reagents thought to confer 
protection from, for example, pathological conditions associated with its overexpression. In accordance with this 
facet of the invention, an animal is treated with the reagent and a reduced incidence of the pathological condition, 
compared to untreated animals bearing the transgene, would indicate a potential therapeutic intervention for the 

25 pathological condition. 

Alternatively, non-human homologues of PR0842 can be used to construct a PR0842 "knockout" animal 
which has a defective or altered gene encoding PR0842 as a result of homologous recombination between the 
endogenous gene encoding PR0842 and altered genomic DNA encoding PR0842 introduced into an embryonic 
stem cell of the animal. For example, cDNA encoding PR0842 can be used to clone genomic DNA encoding 

30 PR0842. in accordance with established techniques. A portion of the genomic DNA encoding PR0842 can be 
deleted or replaced with another gene, such as a gene encoding a selectable marker which can be used to monitor 
integration. Typically, several kilobases of unaltered flanking DNA (both at the 5' and 3' ends) are included in 
the vector [see e.g., Thomas and Capecchi, Cell , 51:503 (1987) for a description of homologous recombination 
vectors]. The vector is introduced into an embryonic stem cell line (e.g., by electroporation) and cells in which 

35 the introduced DNA has homologously recombined with the endogenous DNA are selected [see, e.g. , Li et aL, 
Cell , 69:915 (1992)]. The selected cells are then injected into a blastocyst of an animal (e.g., a mouse or rat) 
to form aggregation chimeras [see, e.g. , Bradley, in Teratocarcinomas and Embryonic Stem Cells: A Practical 
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Approach, E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 113-152]. A chimeric embryo can then be implanted 
into a suitable pseudopregnant female foster animal and the embryo brought to term to create a "knock out" 
animal. Progeny harboring the homologously recombined DNA in their germ cells can be identified by standard 
techniques and used to breed animals in which all cells of the animal contain the homologously recombined DNA. 
Knockout animals can be characterized for instance, for their ability to defend against certain pathological 
5 conditions and for their development of pathological conditions due to absence of the PR0842 polypeptide. 

Nucleic acid encoding the PR0842 polypeptides may also be used in gene therapy. In gene therapy 
applications, genes are introduced into cells in order to achieve in vivo synthesis of a therapeutically effective 
genetic product, for example for replacement of a defective gene. "Gene therapy" includes both conventional 
gene therapy where a lasting effect is achieved by a single treatment, and the administration of gene therapeutic 
10 agents, which involves the one time or repeated administration of a therapeutically effective DNA or mRNA. 
Antisense RNAs and DNAs can be used as therapeutic agents for blocking the expression of certain genes in vivo. 
It has already been shown that short antisense oligonucleotides can be imported into cells where they act as 
inhibitors, despite their low intracellular concentrations caused by their restricted uptake by the cell membrane. 
(Zamecnik et al. , Proc. Natl. Acad. Sci. USA . 83:4143-4146 [1986]). The oligonucleotides can be modified to 

15 enhance their uptake, e.g. , by substituting their negatively charged phosphodiester groups by uncharged groups. 

There are a variety of techniques available for introducing nucleic acids into viable cells. The techniques 
vary depending upon whether the nucleic acid is transferred into cultured cells in vitro, or in vivo in the cells of 
the intended host. Techniques suitable for the transfer of nucleic acid into mammalian cells in vitro include the 
use of liposomes, electroporation, microinjection, cell fusion, DEAE-dextran, the calcium phosphate precipitation 

20 method, etc. The currently preferred in vivo gene transfer techniques include transfection with viral (typically 
retroviral) vectors and viral coat protein-liposome mediated transfection (Dzau et aL, Trends in Biotechnology . 
U.: 205-210 [1993]). In some situations it is desirable to provide the nucleic acid source with an agent that targets 
the target cells, such as an antibody specific for a cell surface membrane protein or the target cell, a ligand for 
a receptor on the target cell, etc. Where liposomes are employed, proteins which bind to a cell surface membrane 

25 protein associated with endocytosis may be used for targeting and/or to facilitate uptake, e.g. , capsid proteins or 
fragments thereof tropic for a particular cell type, antibodies for proteins which undergo internalization in cycling, 
proteins that target intracellular localization and enhance intracellular half-life. The technique of receptor- 
mediated endocytosis is described, for example, by Wu et al., J. Biol. Chem. , 262 : 4429-4432 (1987); and 
Wagner et al. , Proc. Natl. Acad. Sci. USA . 87: 3410-3414 (1990). For review of gene marking and gene therapy 

30 protocols see Anderson et al., Science , 256 : 808-813 (1992). 

The PR0842 polypeptides described herein may also be employed as molecular weight markers for 
protein electrophoresis purposes and the isolated nucleic acid sequences may be used for recombinantly expressing 
those markers. 

The nucleic acid molecules encoding the PR0842 polypeptides or fragments thereof described herein are 
35 useful for chromosome identification. In this regard, there exists an ongoing need to identify new chromosome 
markers, since relatively few chromosome marking reagents, based upon actual sequence data are presently 
available. Each PR0842 nucleic acid molecule of the present invention can be used as a chromosome marker. 
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The PR0842 polypeptides and nucleic acid molecules of the present invention may also be used 
diagnostically for tissue typing, wherein the PR0842 polypeptides of the present invention may be differentially 
expressed in one tissue as compared to another, preferably in a diseased tissue as compared to a normal tissue of 
the same tissue type. PR0842 nucleic acid molecules will find use for generating probes for PCR, Northern 
analysis, Southern analysis and Western analysis. 
5 The PR0842 polypeptides described herein may also be employed as therapeutic agents. The PR0842 

polypeptides of the present invention can be formulated according to known methods to prepare pharmaceutically 
useful compositions, whereby the PR0842 product hereof is combined in admixture with a pharmaceutically 
acceptable carrier vehicle. Therapeutic formulations are prepared for storage by mixing the active ingredient 
having the desired degree of purity with optional physiologically acceptable carriers, excipients or stabilizers 

10 (Remington's Pharmaceutical Sciences 16th edition, Osol, A. Ed. (1980)), in the form of lyophilized formulations 
or aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients at the dosages and 
concentrations employed, and include buffers such as phosphate, citrate and other organic acids; antioxidants 
including ascorbic acid; low molecular weight (less than about 10 residues) polypeptides; proteins, such as serum 
albumin, gelatin or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone, amino acids such as 

15 glycine, glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides and other carbohydrates 
including glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or 
sorbitol; salt-forming counterions such as sodium; and/or nonionic surfactants such as TWEEN™, PLURONICS™ 
or PEG. 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished by 
20 filtration through sterile filtration membranes, prior to or following lyophilization and reconstitution. 

Therapeutic compositions herein generally are placed into a container having a sterile access port, for 
example, an intravenous solution bag or vial having a stopper pierceable by a hypodermic injection needle. 

The route of administration is in accord with known methods, e.g. , injection or infusion by intravenous, 
intraperitoneal, intracerebral, intramuscular, intraocular, intraarterial or intralesional routes, topical 
25 administration, or by sustained release systems. 

Dosages and desired drug concentrations of pharmaceutical compositions of the present invention may 
vary depending on the particular use envisioned. The determination of the appropriate dosage or route of 
administration is well within the skill of an ordinary physician. Animal experiments provide reliable guidance 
for the determination of effective doses for human therapy. Interspecies scaling of effective doses can be 
30 performed following the principles laid down by Mordenti, J. and Chappell, W. "The use of interspecies scaling 
in toxicokinetics" In Toxicokinetics and New Drug Development , Yacobi et al. 9 Eds., Pergamon Press, New 
York 1989, pp. 42-96. 

When in vivo administration of a PR0842 polypeptide or agonist or antagonist thereof is employed, 
normal dosage amounts may vary from about 10 ng/kg to up to 100 mg/kg of mammal body weight or more per 
35 day, preferably about 1 ptg/kg/day to 10 mg/kg/day, depending upon the route of administration. Guidance as 
to particular dosages and methods of delivery is provided in the literature; see, for example, U.S. Pat. Nos. 
4,657,760; 5,206,344; or 5,225,212. It is anticipated that different formulations will be effective for different 



treatment compounds and different disorders, that administration targeting one organ or tissue, for example, may 
necessitate delivery in a manner different from that to another organ or tissue. 

Where sustained-release administration of a PR0842 polypeptide is desired in a formulation with release 
characteristics suitable for the treatment of any disease or disorder requiring administration of the PR0842 
polypeptide, microencapsulation of the PR0842 polypeptide is contemplated. Microencapsulation of recombinant 
proteins for sustained release has been successfully performed with human growth hormone (rhGH), interferon- 
(rhlFN- ), interleukin-2, and MN rgpl20. Johnson et al. , Nat. Med. . 2:795-799 (1996); Yasuda, Biomed. Ther. . 
27: 1221-1223 (1993); Hora et al. , Bio/Technology. 8:755-758 (1990); Cleland, "Design and Production of Single 
Immunization Vaccines Using Polylactide Polyglycolide Microsphere Systems," in Vaccine Design: The Subunit 
and Adjuvant Approach . Powell and Newman, eds, (Plenum Press: New York, 1995), pp. 439-462; WO 
97/03692, WO 96/40072, WO 96/07399; and U.S. Pat. No. 5,654,010. 

The sustained-release formulations of these proteins were developed using poly-lactic-coglycolic acid 
(PLGA) polymer due to its biocompatibility and wide range of biodegradable properties. The degradation products 
of PLGA, lactic and glycolic acids, can be cleared quickly within the human body. Moreover, the degradability 
of this polymer can be adjusted from months to years depending on its molecular weight and composition. Lewis, 
"Controlled release of bioactive agents from lactide/glycolide polymer," in: M. Chasin and R. Langer (Eds.), 
Biodegradable Polymers as Drug Delivery Systems (Marcel Dekker: New York, 1990), pp. 1-41. 

This invention encompasses methods of screening compounds to identify those that mimic the PR0842 
polypeptide (agonists) or prevent the effect of the PR0842 polypeptide (antagonists). Screening assays for 
antagonist drug candidates are designed to identify compounds that bind or complex with the PR0842 polypeptides 
encoded by the genes identified herein, or otherwise interfere with the interaction of the encoded polypeptides with 
other cellular proteins. Such screening assays will include assays amenable to high-throughput screening of 
chemical libraries, making them particularly suitable for identifying small molecule drug candidates. 

The assays can be performed in a variety of formats, including protein-protein binding assays, 
biochemical screening assays, immunoassays, and cell-based assays, which are well characterized in the art. 

All assays for antagonists are common in that they call for contacting the drug candidate with a PR0842 
polypeptide encoded by a nucleic acid identified herein under conditions and for a time sufficient to allow these 
two components to interact. 

In binding assays, the interaction is binding and the complex formed can be isolated or detected in the 
reaction mixture. In a particular embodiment, the PR0842 polypeptide encoded by the gene identified herein or 
the drug candidate is immobilized on a solid phase, e.g., on a microtiter plate, by covalent or non-covalent 
attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a solution of 
the PR0842 polypeptide and drying. Alternatively, an immobilized antibody, e.g., a monoclonal antibody, 
specific for the PR0842 polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is 
performed by adding the non-immobilized component, which may be labeled by a detectable label, to the 
immobilized component, e.g. , the coated surface containing the anchored component. When the reaction is 
complete, the non-reacted components are removed, e.g., by washing, and complexes anchored on the solid 
surface are detected. When the originally non-immobilized component carries a detectable label, the detection 
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of label immobilized on the surface indicates that complexing occurred. Where the originally non-immobilized 
component does not carry a label, complexing can be detected, for example, by using a labeled antibody 
specifically binding the immobilized complex. 

If the candidate compound interacts with but does not bind to a particular PR0842 polypeptide encoded 
by a gene identified herein, its interaction with that polypeptide can be assayed by methods well known for 
detecting protein-protein interactions. Such assays include traditional approaches, such as, e.g., cross-linking, 
co-immunoprecipitation, and co-purification through gradients or chromatographic columns. In addition, protein- 
protein interactions can be monitored by using a yeast-based genetic system described by Fields and co-workers 
(Fields and Song, Nature (London) . 340:245-246 [1989]); Chien et aL, Proc. Natl. Acad. Sci. USA . 88:9578- 
9582 [1991]) as disclosed by Chevray and Nathans, Proc. Natl. Acad. Sci. USA . 89:5789-5793 (1991). Many 
transcriptional activators, such as yeast GAL4, consist of two physically discrete modular domains, one acting 
as the DNA-binding domain, the other one functioning as the transcription-activation domain. The yeast 
expression system described in the foregoing publications (generally referred to as the "two-hybrid system") takes 
advantage of this property, and employs two hybrid proteins, one in which the target protein is fused to the DNA- 
binding domain of GAL4, and another, in which candidate activating proteins are fused to the activation domain. 
The expression of a GALl-/acZ reporter gene under control of a GAL4-activated promoter depends on 
reconstitution of GAL4 activity via protein-protein interaction. Colonies containing interacting polypeptides are 
detected with a chromogenic substrate for P-galactosidase. A complete kit (MATCHMAKER™) for identifying 
protein-protein interactions between two specific proteins using the two-hybrid technique is commercially available 
from Clontech. This system can also be extended to map protein domains involved in specific protein interactions 
as well as to pinpoint amino acid residues that are crucial for these interactions. 

Compounds that interfere with the interaction of a gene encoding a PR0842 polypeptide identified herein 
and other intra- or extracellular components can be tested as follows: usually a reaction mixture is prepared 
containing the product of the gene and the intra- or extracellular component under conditions and for a time 
allowing for the interaction and binding of the two products. To test the ability of a candidate compound to inhibit 
binding, the reaction is run in the absence and in the presence of the test compound. In addition, a placebo may 
be added to a third reaction mixture, to serve as positive control. The binding (complex formation) between the 
test compound and the intra- or extracellular component present in the mixture is monitored as described 
hereinabove. The formation of a complex in the control reaction(s) but not in the reaction mixture containing the 
test compound indicates that the test compound interferes with the interaction of the test compound and its reaction 
partner. 

To assay for antagonists, the PR0842 polypeptide may be added to a cell along with the compound to 
be screened for a particular activity and the ability of the compound to inhibit the activity of interest in the 
presence of the PR0842 polypeptide indicates that the compound is an antagonist to the PR0842 polypeptide. 
Alternatively, antagonists may be detected by combining the PR0842 polypeptide and a potential antagonist with 
membrane-bound PR0842 polypeptide receptors or recombinant receptors under appropriate conditions for a 
competitive inhibition assay. The PR0842 polypeptide can be labeled, such as by radioactivity, such that the 
number of PR0842 polypeptide molecules bound to the receptor can be used to determine the effectiveness of the 
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potential antagonist. The gene encoding the receptor can be identified by numerous methods known to those of 
skill in the art, for example, ligand panning and FACS sorting. Coligan et aL, Current Protocols in Immun. , 
1(2) : Chapter 5 (1991). Preferably, expression cloning is employed wherein polyadenylated RNA is prepared 
from a cell responsive to the PR0842 polypeptide and a cDNA library created from this RNA is divided into pools 
and used to transfect COS cells or other cells that are not responsive to the PR0842 polypeptide. Transfected 
5 cells that are grown on glass slides are exposed to labeled PR0842 polypeptide. The PR0842 polypeptide can 
be labeled by a variety of means including iodination or inclusion of a recognition site for a site-specific protein 
kinase. Following fixation and incubation, the slides are subjected to autoradiographic analysis. Positive pools 
are identified and sub-pools are prepared and re-transfected using an interactive sub-pooling and re-screening 
process, eventually yielding a single clone that encodes the putative receptor. 

10 As an alternative approach for receptor identification, labeled PR0842 polypeptide can be photoaffinity- 

linked with cell membrane or extract preparations that express the receptor molecule. Cross-linked material is 
resolved by PAGE and exposed to X-ray film. The labeled complex containing the receptor can be excised, 
resolved into peptide fragments, and subjected to protein micro-sequencing. The amino acid sequence obtained 
from micro- sequencing would be used to design a set of degenerate oligonucleotide probes to screen a cDNA 

15 library to identify the gene encoding the putative receptor. 

In another assay for antagonists, mammalian cells or a membrane preparation expressing the receptor 
would be incubated with labeled PR0842 polypeptide in the presence of the candidate compound. The ability of 
the compound to enhance or block this interaction could then be measured. 

More specific examples of potential antagonists include an oligonucleotide that binds to the fusions of 

20 immunoglobulin with PR0842 polypeptide, and, in particular, antibodies including, without limitation, poly- and 
monoclonal antibodies and antibody fragments, single-chain antibodies, anti-idiotypic antibodies, and chimeric 
or humanized versions of such antibodies or fragments, as well as human antibodies and antibody fragments. 
Alternatively, a potential antagonist may be a closely related protein, for example, a mutated form of the PR0842 
polypeptide that recognizes the receptor but imparts no effect, thereby competitively inhibiting the action of the 

25 PR0842 polypeptide. 

Another potential PR0842 polypeptide antagonist is an antisense RNA or DNA construct prepared using 
antisense technology, where, e.g., an antisense RNA or DNA molecule acts to block directly the translation of 
mRNA by hybridizing to targeted mRNA and preventing protein translation. Antisense technology can be used 
to control gene expression through triple-helix formation or antisense DNA or RNA, both of which methods are 

30 based on binding of a polynucleotide to DNA or RNA. For example, the 5' coding portion of the polynucleotide 
sequence, which encodes the mature PR0842 polypeptides herein, is used to design an antisense RNA 
oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide is designed to be 
complementary to a region of the gene involved in transcription (triple helix - see Lee et aL , Nucl. Acids Res. . 
6:3073 (1979); Cooney et aL, Science , 241:456 (1988); Dervan et aL, Science , 251:1360 (1991)), thereby 

35 preventing transcription and the production of the PR0842 polypeptide. The antisense RNA oligonucleotide 
hybridizes to the mRNA in vivo and blocks translation of the mRNA molecule into the PR0842 polypeptide 
(antisense - Okano, Neurochem. . 56:560 (1991); Oligodeoxvnucleotides as Antisense Inhibitors of Gene 



Expression (CRC Press: Boca Raton, FL, 1988). The oligonucleotides described above can also be delivered to 
cells such that the antisense RNA or DNA may be expressed in vivo to inhibit production of the PR0842 
polypeptide. When antisense DNA is used, oligodeoxy ribonucleotides derived from the translation-initiation site, 
e.g., between about -10 and +10 positions of the target gene nucleotide sequence, are preferred. 

Potential antagonists include small molecules that bind to the active site, the receptor binding site, or 
5 growth factor or other relevant binding site of the PR0842 polypeptide, thereby blocking the normal biological 
activity of the PR0842 polypeptide. Examples of small molecules include, but are not limited to, small peptides 
or peptide-like molecules, preferably soluble peptides, and synthetic non-peptidyl organic or inorganic compounds. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. 
Ribozymes act by sequence-specific hybridization to the complementary target RNA, followed by endonucleolytic 
10 cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified by known techniques. 
For further details see, e.g. , Rossi, Current Biology , 4:469-471 (1994), and PCT publication No. WO 97/33551 
(published September 18, 1997). 

Nucleic acid molecules in triple-helix formation used to inhibit transcription should be single-stranded 
and composed of deoxynucleotides. The base composition of these oligonucleotides is designed such that it 
15 promotes triple-helix formation via Hoogsteen base-pairing rules, which generally require sizeable stretches of 
purines or pyrimidines on one strand of a duplex. For further details see, e.g., PCT publication No. WO 
97/33551, supra. 

These small molecules can be identified by any one or more of the screening assays discussed 
hereinabove and/or by any other screening techniques well known for those skilled in the art. 
20 Diagnostic and therapeutic uses of the herein disclosed molecules may also be based upon the positive 

functional assay hits disclosed and described below 

F. Tissue Distribution 

The location of tissues expressing the PR0842 can be identified by determining mRNA expression in 
25 various human tissues. The location of such genes provides information about which tissues are most likely to 
be affected by the stimulating and inhibiting activities of the PR0842 polypeptides. The location of a gene in 
a specific tissue also provides sample tissue for the activity blocking assays discussed below. 

As noted before, gene expression in various tissues may be measured by conventional Southern blotting, 
Northern blotting to quantitate the transcription of mRNA (Thomas, Proc. Natl. Acad. Sci. USA . 77:5201-5205 
30 [1980]), dot blotting (DNA analysis), or in situ hybridization, using an appropriately labeled probe, based on the 
sequences provided herein. Alternatively, antibodies may be employed that can recognize specific duplexes, 
including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein duplexes. 

Gene expression in various tissues, alternatively, may be measured by immunological methods, such as 
immunohistochemical staining of tissue sections and assay of cell culture or body fluids, to quantitate directly the 
35 expression of gene product. Antibodies useful for immunohistochemical staining and/or assay of sample fluids 
may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, the antibodies may 
be prepared against a native sequence of a PR0842 polypeptide or against a synthetic peptide based on the DNA 
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sequences encoding the PR0842 polypeptide or against an exogenous sequence fused to a DNA encoding a 
PR0842 polypeptide and encoding a specific antibody epitope. General techniques for generating antibodies, and 
special protocols for Northern blotting and in situ hybridization are provided below. 

G. Antibody Binding Studies 

5 The activity of the PRO 842 polypeptides can be further verified by antibody binding studies, in which 

the ability of anti-PR0842 antibodies to inhibit the effect of the PR0842 polypeptides, respectively, on tissue cells 
is tested. Exemplary antibodies include polyclonal, monoclonal, humanized, bispecific, and heteroconjugate 
antibodies, the preparation of which will be described hereinbelow. 

Antibody binding studies may be carried out in any known assay method, such as competitive binding 
10 assays, direct and indirect sandwich assays, and immunoprecipitation assays. Zola, Monoclonal Antibodies: A 
Manual of Techniques , pp. 147-158 (CRC Press, Inc., 1987). 

Competitive binding assays rely on the ability of a labeled standard to compete with the test sample 
analyte for binding with a limited amount of antibody. The amount of target protein in the test sample is inversely 
proportional to the amount of standard that becomes bound to the antibodies. To facilitate determining the amount 
15 of standard that becomes bound, the antibodies preferably are insolubilized before or after the competition, so that 
the standard and analyte that are bound to the antibodies may conveniently be separated from the standard and 
analyte which remain unbound. 

Sandwich assays involve the use of two antibodies, each capable of binding to a different immunogenic 
portion, or epitope, of the protein to be detected. In a sandwich assay, the test sample analyte is bound by a first 
20 antibody which is immobilized on a solid support, and thereafter a second antibody binds to the analyte, thus 
forming an insoluble three-part complex. See, e.g., US Pat No. 4,376,110. The second antibody may itself be 
labeled with a detectable moiety (direct sandwich assays) or may be measured using an anti-immunoglobulin 
antibody that is labeled with a detectable moiety (indirect sandwich assay). For example, one type of sandwich 
assay is an ELISA assay, in which case the detectable moiety is an enzyme. 
25 For immunohistochemistry, the tissue sample may be fresh or frozen or may be embedded in paraffin and fixed 
with a preservative such as formalin, for example. 

H. Cell-Based Assays 

Cell-based assays and animal models for immune related diseases can be used to further understand the 
30 relationship between the genes and polypeptides identified herein and the development and pathogenesis of 
immune related disease. 

In a different approach, cells of a cell type known to be involved in a particular immune related disease 
are transfected with the cDNAs described herein, and the ability of these cDNAs to stimulate or inhibit immune 
function is analyzed. Suitable cells can be transfected with the desired gene, and monitored for immune function 
35 activity. Such transfected cell lines can then be used to test the ability of poly- or monoclonal antibodies or 
antibody compositions to inhibit or stimulate immune function, for example to modulate T-cell proliferation or 
inflammatory cell infiltration. Cells transfected with the coding sequences of the genes identified herein can 
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further be used to identify drug candidates for the treatment of immune related diseases. 

In addition, primary cultures derived from transgenic animals (as described below) can be used in the 
cell-based assays herein, although stable cell lines are preferred. Techniques to derive continuous cell lines from 
transgenic animals are well known in the art (see, e.g., Small et al., Mol. Cell. Biol.. 5: 642-648 [1985]). 

One suitable cell based assay is the mixed lymphocyte reaction (MLR). Current Protocols in 
5 Immunology , unit 3.12; edited by J E Coligan, A M Kruisbeek, D H Marglies, E M Shevach, W Strober, 
National Institutes of Health, Published by John Wiley & Sons, Inc. In this assay, the ability of a test compound 
to stimulate or inhibit the proliferation of activated T cells is assayed. A suspension of responder T cells is 
cultured with allogeneic stimulator cells and the proliferation of T cells is measured by uptake of tritiated 
thymidine. This assay is a general measure of T cell reactivity. Since the majority of T cells respond to and 

10 produce IL-2 upon activation, differences in responsiveness in this assay in part reflect differences in IL-2 
production by the responding cells. The MLR results can be verified by a standard lymphokine (IL-2) detection 
assay. Current Protocols in Immunology , above, 3.15, 6.3. 

A proliferative T cell response in an MLR assay may be due to direct mitogenic properties of an assayed 
molecule or to external antigen induced activation. Additional verification of the T cell stimulatory activity of 

15 the PR0842 polypeptides can be obtained by a costimulation assay. T cell activation requires an antigen specific 
signal mediated through the T-cell receptor (TCR) and a costimulatory signal mediated through a second ligand 
binding interaction, for example, the B7 (CD80, CD86)/CD28 binding interaction. CD28 crosslinking increases 
lymphokine secretion by activated T cells. T cell activation has both negative and positive controls through the 
binding of ligands which have a negative or positive effect. CD28 and CTLA-4 are related glycoproteins in the 

20 Ig superfamily which bind to B7. CD28 binding to B7 has a positive costimulation effect of T cell activation; 
conversely, CTLA-4 binding to B7 has a negative T cell deactivating effect. Chambers, C. A. and Allison, J. P.. 
Curr. Opin. Immunol., (1997)_9:396. Schwartz, R. H., Cell (1992) 71:1065; Linsley, P. S. and Ledbetter, J. 
A., Annu. Rev. Immunol. (1993) 11:191; June, C. H. etal, Immunol. Today (1994) 15:321; Jenkins, M. K., 
Immunity (1994) 1:405. In a costimulation assay, the PR0842 polypeptides are assayed for T cell costimulatory 

25 or inhibitory activity. 

PR0842 polypeptides, as well as other compounds of the invention, which are stimulators (costimulators) 
of T cell proliferation and agonists, e.g., agonist antibodies, thereto as determined by MLR and costimulation 
assays, for example, are useful in treating immune related diseases characterized by poor, suboptimal or 
inadequate immune function. These diseases are treated by stimulating the proliferation and activation of T cells 

30 (and T cell mediated immunity) and enhancing the immune response in a mammal through administration of a 
stimulatory compound, such as the stimulating PR0842 polypeptides. The stimulating polypeptide may, for 
example, be a PR0842 polypeptide or an agonist antibody thereof. 

Direct use of a stimulating compound as in the invention has been validated in experiments with 4- IBB 
glycoprotein, a member of the tumor necrosis factor receptor family, which binds to a ligand (4-1BBL) expressed 

35 on primed T cells and signals T cell activation and growth. Alderson, M. E. et al. , J. Immunol., 24:2219 (1994). 

The use of an agonist stimulating compound has also been validated experimentally. Activation of 
4-1BB by treatment with an agonist anti-4-lBB antibody enhances eradication of tumors. Hellstrom, I. and 
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Hellstrom, K. E.. Crit. Rev. Immunol.. 18:1 (1998). Immunoadjuvant therapy for treatment of tumors, described 
in more detail below, is another example of the use of the stimulating compounds of the invention. 
An immune stimulating or enhancing effect can also be achieved by antagonizing or blocking the activity of a 
PR0842 which has been found to be inhibiting in the MLR assay. Negating the inhibitory activity of the 
compound produces a net stimulatory effect. Suitable antagonists/blocking compounds are antibodies or fragments 
5 thereof which recognize and bind to the inhibitory protein, thereby blocking the effective interaction of the protein 
with its receptor and inhibiting signaling through the receptor. This effect has been validated in experiments using 
anti-CTLA-4 antibodies which enhance T cell proliferation, presumably by removal of the inhibitory signal caused 
by CTLA-4 binding. Walunas, T. L. et al., Immunity. 1:405 (1994). 

Alternatively, an immune stimulating or enhancing effect can also be achieved by administration of a 
10 PR0842 polypeptide which has vascular permeability enhancing properties. Enhanced vacuolar permeability 
would be beneficial to disorders which can be attenuated by local infiltration of immune cells {e.g., monocytes, 
eosinophils, PMNs) and inflammation. 

On the other hand, PR0842 polypeptides, as well as other compounds of the invention, which are direct 
inhibitors of T cell proliferation/activation, lymphokine secretion, and/or vascular permeability can be directly 
15 used to suppress the immune response. These compounds are useful to reduce the degree of the immune response 
and to treat immune related diseases characterized by a hyperactive, superoptimal, or autoimmune response. This 
use of the compounds of the invention has been validated by the experiments described above in which CTLA-4 
binding to receptor B7 deactivates T cells. The direct inhibitory compounds of the invention function in an 
analogous manner. The use of compound which suppress vascular permeability would be expected to reduce 
20 inflammation. Such uses would be beneficial in treating conditions associated with excessive inflammation. 

Alternatively, compounds, e.g., antibodies, which bind to stimulating PR0842 polypeptides and block 
the stimulating effect of these molecules produce a net inhibitory effect and can be used to suppress the T cell 
mediated immune response by inhibiting T cell proliferation/activation and/or lymphokine secretion. Blocking 
the stimulating effect of the polypeptides suppresses the immune response of the mammal. This use has been 
25 validated in experiments using an anti-IL2 antibody. In these experiments, the antibody binds to IL2 and blocks 
binding of IL2 to its receptor thereby achieving a T cell inhibitory effect. 

I. Animal Models 

The results of the cell based in vitro assays can be further verified using in vivo animal models and assays 
30 for T-cell function. A variety of well known animal models can be used to further understand the role of the 
genes identified herein in the development and pathogenesis of immune related disease, and to test the efficacy 
of candidate therapeutic agents, including antibodies, and other antagonists of the native polypeptides, including 
small molecule antagonists. The in vivo nature of such models makes them predictive of responses in human 
patients. Animal models of immune related diseases include both non-recombinant and recombinant (transgenic) 
35 animals. Non-recombinant animal models include, for example, rodent, e.g., murine models. Such models can 
be generated by introducing cells into syngeneic mice using standard techniques, e.g., subcutaneous injection, 
tail vein injection, spleen implantation, intraperitoneal implantation, implantation under the renal capsule, etc. 
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Graft- versus-host disease occurs when immunocompetent cells are transplanted into immunosuppressed 
or tolerant patients. The donor cells recognize and respond to host antigens. The response can vary from life 
threatening severe inflammation to mild cases of diarrhea and weight loss. Graft- versus-host disease models 
provide a means of assessing T cell reactivity against MHC antigens and minor transplant antigens. A suitable 
procedure is described in detail in Current Protocols in Immunology , above, unit 4.3. 
5 An animal model for skin allograft rejection is a means of testing the ability of T cells to mediate in vivo 

tissue destruction and a measure of their role in transplant rejection. The most common and accepted models use 
murine tail-skin grafts. Repeated experiments have shown that skin allograft rejection is mediated by T cells, 
helper T cells and killer-effector T cells, and not antibodies. Auchincloss, H. Jr. and Sachs, D. H., Fundamental 
Immunology . 2nd ed., W. E. Paul ed., Raven Press, NY, 889-992 (1989). A suitable procedure is described in 
10 detail in Current Prptocols in Immunology , above, unit 4.4. Other transplant rejection models which can be used 
to test the compounds of the invention are the allogeneic heart transplant models described by Tanabe, M. et al. , 
Transplantation. 58:23 (1994) and Tinubu, S. A. et al., J. Immunol. . 4330-4338 (1994). 

Animal models for delayed type hypersensitivity provides an assay of cell mediated immune function as 
well. Delayed type hypersensitivity reactions are a T cell mediated in vivo immune response characterized by 

15 inflammation which does not reach a peak until after a period of time has elapsed after challenge with an antigen. 
These reactions also occur in tissue specific autoimmune diseases such as multiple sclerosis (MS) and experimental 
autoimmune encephalomyelitis (EAE, a model for MS). A suitable procedure is described in detail in Current 
Protocols in Immunology , above, unit 4.5. 

EAE is a T cell mediated autoimmune disease characterized by T cell and mononuclear cell inflammation 

20 and subsequent demyelination of axons in the central nervous system. EAE is generally considered to be a 
relevant animal model for MS in humans. Bolton, C, Multiple Sclerosis. 1:143 (1995). Both acute and 
relapsing-remitting models have been developed. The compounds of the invention can be tested for T cell 
stimulatory or inhibitory activity against immune mediated demyelinating disease using the protocol described in 
Current Protocols in Immunology , above, units 15.1 and 15.2. See also the models for myelin disease in which 

25 oligodendrocytes or Schwann cells are grafted into the central nervous system as described in Duncan, I. D. et 
al., Molec. Med. Today. 554-561 (1997). 

Contact hypersensitivity is a simple delayed type hypersensitivity in vivo assay of cell mediated immune 
function. In this procedure, cutaneous exposure to exogenous haptens which gives rise to a delayed type 
hypersensitivity reaction which is measured and quantitated. Contact sensitivity involves an initial sensitizing 

30 phase followed by an elicitation phase . The elicitation phase occurs when the T lymphocytes encounter an antigen 
to which they have had previous contact. Swelling and inflammation occur, making this an excellent model of 
human allergic contact dermatitis. A suitable procedure is described in detail in Current Protocols in 
Immunology . Eds. J. E. Cologan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach and W. Strober, John 
Wiley & Sons, Inc., unit 4.2 (1994). I also Grabbe, S. and Schwarz, T, Immun. Today. 19 (1) : 37-44 (1998). 

35 An animal model for arthritis is collagen-induced arthritis. This model shares clinical, histological and 

immunological characteristics of human autoimmune rheumatoid arthritis and is an acceptable model for human 
autoimmune arthritis. Mouse and rat models are characterized by synovitis, erosion of cartilage and subchondral 
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bone. The compounds of the invention can be tested for activity against autoimmune arthritis using the protocols 
described in Current Protocols in Immunology , above, units 15.5. See also the model using a monoclonal 
antibody to CD18 and VLA-4 integrins described in Issekutz, A.C. et al., Immunology , 88:569 (1996). 

A model of asthma has been described in which antigen-induced airway hyper-reactivity, pulmonary 
eosinophilia and inflammation are induced by sensitizing an animal with ovalbumin and then challenging the 
animal with the same protein delivered by aerosol. Several animal models (guinea pig, rat, non-human primate) 
show symptoms similar to atopic asthma in humans upon challenge with aerosol antigens. Murine models have 
many of the features of human asthma. Suitable procedures to test the compounds of the invention for activity 
and effectiveness in the treatment of asthma are described by Wolyniec, W. W. et al. , Am. J. Respir. Cell Mol. 
Biol. , 18:777 (1998) and the references cited therein. 

Additionally, the compounds of the invention can be tested on animal models for psoriasis like diseases. 
Evidence suggests a T cell pathogenesis for psoriasis. The compounds of the invention can be tested in the 
scid/scid mouse model described by Schon, M. P. etal. , Nat. Med . , 3: 183 (1997), in which the mice demonstrate 
histopathologic skin lesions resembling psoriasis. Another suitable model is the human skin/scid mouse chimera 
prepared as described by Nickoloff, B. J. et al., Am. J. Path. . 146:580 (1995). 

Recombinant (transgenic) animal models can be engineered by introducing the coding portion of the genes 
identified herein into the genome of animals of interest, using standard techniques for producing transgenic 
animals. Animals that can serve as a target for transgenic manipulation include, without limitation, mice, rats, 
rabbits, guinea pigs, sheep, goats, pigs, and non-human primates, e.g., baboons, chimpanzees and monkeys. 
Techniques known in the art to introduce a transgene into such animals include pronucleic microinjection (Hoppe 
and Wanger, U.S. Patent No. 4,873, 191); retrovirus-mediated gene transfer into germ lines {e.g. , Van der Putten 
etal., Proc. Natl. Acad. Sci. USA . 82, 6148-615 [1985]); gene targeting in embryonic stem cells (Thompson 
et al., Cell. 56, 313-321 [1989]); electroporation of embryos (Lo, Mol. Cel. Biol.. 3, 1803-1814 [1983]); 
sperm-mediated gene transfer (Lavitrano et al., Cell, 57, 717-73 [1989]). For review, see, for example, U.S. 
Patent No. 4,736,866. 

For the purpose of the present invention, transgenic animals include those that carry the transgene only 
in part of their cells ("mosaic animals"). The transgene can be integrated either as a single transgene, or in 
concatamers, e.g., head-to-head or head-to-tail tandems. Selective introduction of a transgene into a particular 
cell type is also possible by following, for example, the technique of Lasko et al., Proc. Natl. Acad. Sci. USA. 
89, 6232-636 (1992). 

The expression of the transgene in transgenic animals can be monitored by standard techniques. For 
example, Southern blot analysis or PCR amplification can be used to verify the integration of the transgene. The 
level of mRNA expression can then be analyzed using techniques such as in situ hybridization, Northern blot 
analysis, PCR, or immunocytochemistry. 

The animals may be further examined for signs of immune disease pathology, for example by histological 
examination to determine infiltration of immune cells into specific tissues. Blocking experiments can also be 
performed in which the transgenic animals are treated with the compounds of the invention to determine the extent 
of the T cell proliferation stimulation or inhibition of the compounds. In these experiments, blocking antibodies 
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which bind to the PR0842 polypeptide, prepared as described above, are administered to the animal and the effect 
on immune function is determined. 

Alternatively, "knock out" animals can be constructed which have a defective or altered gene encoding 
a polypeptide identified herein, as a result of homologous recombination between the endogenous gene encoding 
the polypeptide and altered genomic DNA encoding the same polypeptide introduced into an embryonic cell of 
5 the animal. For example, cDNA encoding a particular polypeptide can be used to clone genomic DNA encoding 
that polypeptide in accordance with established techniques. A portion of the genomic DNA encoding a particular 
polypeptide can be deleted or replaced with another gene, such as a gene encoding a selectable marker which can 
be used to monitor integration. Typically, several kilobases of unaltered flanking DNA (both at the 5' and 3' 
ends) are included in the vector [see e.g., Thomas and Capecchi, Cell . 51:503 (1987) for a description of 

10 homologous recombination vectors]. The vector is introduced into an embryonic stem cell line (e.g., by 
electroporation) and cells in which the introduced DNA has homologously recombined with the endogenous DNA 
are selected [see e.g., Li etal., Cell . 69:915 (1992)]. The selected cells are then injected into a blastocyst of an 
animal (e.g., a mouse or rat) to form aggregation chimeras [see e.g., Bradley, in Teratocarcinomas and 
Embryonic Stem Cells: A Practical Approach . E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 113-152]. A 

15 chimeric embryo can then be implanted into a suitable pseudopregnant female foster animal and the embryo 
brought to term to create a "knock out" animal. Progeny harboring the homologously recombined DNA in their 
germ cells can be identified by standard techniques and used to breed animals in which all cells of the animal 
contain the homologously recombined DNA. Knockout animals can be characterized for instance, for their ability 
to defend against certain pathological conditions and for their development of pathological conditions due to 

20 absence of the polypeptide. 

J. Immuno Adjuvant Therapy 

In one embodiment, the immunostimulating compounds of the invention can be used in immunoadjuvant 
therapy for the treatment of tumors (cancer). It is now well established that T cells recognize human tumor 

25 specific antigens. One group of tumor antigens, encoded by the MAGE, BAGE and GAGE families of genes, 
are silent in all adult normal tissues , but are expressed in significant amounts in tumors, such as melanomas, lung 
tumors, head and neck tumors, and bladder carcinomas. DeSmet, C. etal. , Proc. Natl. Acad. Sci. USA . 93:7149 
(1996). It has been shown that costimulation of T cells induces tumor regression and an antitumor response both 
in vitro and in vivo. Melero, I. et al., Nature Medicine. 3 :682 (1997); Kwon, E. D. et al., Proc. Natl. Acad. 

30 Sci. USA. 94: 8099 (1997); Lynch, D. H. etal.. Nature Medicine. 3:625 (1997); Finn, O. J. and Lotze, M. T., 
J. Immunol. . 21 : 1 14 (1998). The stimulatory compounds of the invention can be administered as adjuvants, alone 
or together with a growth regulating agent, cytotoxic agent or chemotherapeutic agent, to stimulate T cell 
proliferation/activation and an antitumor response to tumor antigens. The growth regulating, cytotoxic, or 
chemotherapeutic agent may be administered in conventional amounts using known administration regimes. 

35 Immunostimulating activity by the compounds of the invention allows reduced amounts of the growth regulating, 
cytotoxic, or chemotherapeutic agents thereby potentially lowering the toxicity to the patient. 
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K. Screening Assays for Drug Candidates 

Screening assays for drug candidates are designed to identify compounds that bind to or complex with 
the polypeptides encoded by the genes identified herein or a biologically active fragment thereof, or otherwise 
interfere with the interaction of the encoded polypeptides with other cellular proteins. Such screening assays will 
include assays amenable to high-throughput screening of chemical libraries, making them particularly suitable for 
5 identifying small molecule drug candidates. Small molecules contemplated include synthetic organic or inorganic 
compounds, including peptides, preferably soluble peptides, (poly)peptide-irnmunoglobulin fusions, and, in 
particular, antibodies including, without limitation, poly- and monoclonal antibodies and antibody fragments, 
single-chain antibodies, anti-idiotypic antibodies, and chimeric or humanized versions of such antibodies or 
fragments, as well as human antibodies and antibody fragments. The assays can be performed in a variety of 

10 formats, including protein-protein binding assays, biochemical screening assays, immunoassays and cell based 
assays, which are well characterized in the art. All assays are common in that they call for contacting the drug 
candidate with a polypeptide encoded by a nucleic acid identified herein under conditions and for a time sufficient 
to allow these two components to interact. 

In binding assays, the interaction is binding and the complex formed can be isolated or detected in the 

15 reaction mixture. In a particular embodiment, the polypeptide encoded by the gene identified herein or the drug 
candidate is immobilized on a solid phase, e.g., on a microtiter plate, by covalent or non-covalent attachments. 
Non-covalent attachment generally is accomplished by coating the solid surface with a solution of the polypeptide 
and drying. Alternatively, an immobilized antibody, e.g., a monoclonal antibody, specific for the polypeptide 
to be immobilized can be used to anchor it to a solid surface. The assay is performed by adding the 

20 non-immobilized component, which may be labeled by a detectable label, to the immobilized component, e.g., 
the coated surface containing the anchored component. When the reaction is complete, the non-reacted 
components are removed, e.g., by washing, and complexes anchored on the solid surface are detected. When 
the originally non-immobilized component carries a detectable label, the detection of label immobilized on the 
surface indicates that complexing occurred. Where the originally non-immobilized component does not carry a 

25 label, complexing can be detected, for example, by using a labelled antibody specifically binding the immobilized 
complex. 

If the candidate compound interacts with but does not bind to a particular protein encoded by a gene 
identified herein, its interaction with that protein can be assayed by methods well known for detecting 
protein-protein interactions. Such assays include traditional approaches, such as, cross-linking, 

30 co-immunoprecipitation, and co-purification through gradients or chromatographic columns. In addition, 
protein-protein interactions can be monitored by using a yeast-based genetic system described by Fields and 
co-workers [Fields and Song, Nature (London). 340 . 245-246 (1989); Chien etaL, Proc. Natl. Acad. Sci. USA, 
88, 9578-9582 (1991)] as disclosed by Chevray and Nathans, Proc. Natl. Acad. Sci. USA. 89, 5789-5793 (1991). 
Many transcriptional activators, such as yeast GAL4, consist of two physically discrete modular domains, one 

35 acting as the DNA-binding domain, while the other one functioning as the transcription activation domain. The 
yeast expression system described in the foregoing publications (generally referred to as the "two-hybrid system") 
takes advantage of this property, and employs two hybrid proteins, one in which the target protein is fused to the 
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DNA-binding domain of GAL4, and another, in which candidate activating proteins are fused to the activation 
domain. The expression of a GALl-lacZ reporter gene under control of a GAL4-activated promoter depends on 
reconstitution of GAL4 activity via protein-protein interaction. Colonies containing interacting polypeptides are 
detected with a chromogenic substrate for P-galactosidase. A complete kit (MATCHMAKER™) for identifying 
protein-protein interactions between two specific proteins using the two-hybrid technique is commercially available 
5 from Clontech. This system can also be extended to map protein domains involved in specific protein interactions 
as well as to pinpoint amino acid residues that are crucial for these interactions. 

In order to find compounds that interfere with the interaction of a gene identified herein and other intra- 
or extracellular components can be tested, a reaction mixture is usually prepared containing the product of the 
gene and the intra- or extracellular component under conditions and for a time allowing for the interaction and 

10 binding of the two products. To test the ability of a test compound to inhibit binding, the reaction is run in the 
absence and in the presence of the test compound. In addition, a placebo may be added to a third reaction 
mixture, to serve as positive control. The binding (complex formation) between the test compound and the intra- 
or extracellular component present in the mixture is monitored as described above. The formation of a complex 
in the control reaction(s) but not in the reaction mixture containing die test compound indicates that the test 

15 compound interferes with the interaction of the test compound and its reaction partner. 



L. Compositions and Methods for the Treatment of Immune Related Diseases 
The compositions useful in the treatment of immune related diseases include, without limitation, proteins, 
antibodies, small organic molecules, peptides, phosphopeptides, antisense and ribozyme molecules, triple helix 
20 molecules, etc. that inhibit or stimulate immune function, for example, T cell proliferation/activation, lymphokine 
release, or immune cell infiltration. 

For example, antisense RNA and RNA molecules act to directly block the translation of mRNA by 
hybridizing to targeted mRNA and preventing protein translation. When antisense DNA is used, 
oligodeoxyribonucleotides derived from the translation initiation site, e.g., between about -10 and + 10 positions 
25 of the target gene nucleotide sequence, are preferred. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. 
Ribozymes act by sequence-specific hybridization to the complementary target RNA, followed by endonucleolytic 
cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified by known techniques. 
For further details see, e.g. , Rossi, Current Biology. 4, 469-471 (1994), and PCT publication No. WO 97/33551 
30 (published September 18, 1997). 

Nucleic acid molecules in triple helix formation used to inhibit transcription should be single-stranded 
and composed of deoxynucleotides. The base composition of these oligonucleotides is designed such that it 
promotes triple helix formation via Hoogsteen base pairing rules, which generally require sizeable stretches of 
purines or pyrimidines on one strand of a duplex. For further details see, e.g., PCT publication No. WO 
35 97/33551, supra. 

These molecules can be identified by any or any combination of the screening assays discussed above 
and/or by any other screening techniques well known for those skilled in the art. 




M. Anti-PRQ842 Antibodies 

The present invention further provides anti-PR0842 antibodies. Exemplary antibodies include 
polyclonal, monoclonal, humanized, bispecific, and heteroconjugate antibodies. 



The anti-PR0842 antibodies may comprise polyclonal antibodies. Methods of preparing polyclonal 
antibodies are known to the skilled artisan. Polyclonal antibodies can be raised in a mammal, for example, by 
one or more injections of an immunizing agent and, if desired, an adjuvant. Typically, the immunizing agent 
and/or adjuvant will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. The 
immunizing agent may include the PR0842 polypeptide or a fusion protein thereof. It may be useful to conjugate 
the immunizing agent to a protein known to be immunogenic in the mammal being immunized. Examples of such 
immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, bovine 
thyroglobulin, and soybean trypsin inhibitor. Examples of adjuvants which may be employed include Freund's 
complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 
The immunization protocol may be selected by one skilled in the art without undue experimentation. 



The anti-PR0842 antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may 
be prepared using hybridoma methods, such as those described by Kohler and Milstein, Nature , 256 :495 (1975). 
In a hybridoma method, a mouse, hamster, or other appropriate host animal, is typically immunized with an 
immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will specifically 
bind to the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. 

The immunizing agent will typically include the PR0842 polypeptide or a fusion protein thereof. 
Generally, either peripheral blood lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen 
cells or lymph node cells are used if non-human mammalian sources are desired. The lymphocytes are then fused 
with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell 
[Goding, Monoclonal Antibodies: Principles and Practice . Academic Press, (1986) pp. 59-103]. Immortalized 
cell lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and human 
origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells may be cultured in a 
suitable culture medium that preferably contains one or more substances that inhibit the growth or survival of the 
unfused, immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will include 
hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the growth of 
HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression of 
antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More 
preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from the Salk 
Institute Cell Distribution Center, San Diego, California and the American Type Culture Collection, Manassas, 



1. 



Polyclonal Antibodies 



2. 



Monoclonal Antibodies 
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Virginia. Human myeloma and mouse-human heteromyeloma cell lines also have been described for the 
production of human monoclonal antibodies [Kozbor, J. Immunol. , 133:3001 (1984); Brodeur et al. , Monoclonal 
Antibody Production Techniques and Applications , Marcel Dekker, Inc., New York, (1987) pp. 51-63], 

The culture medium in which the hybridoma cells are cultured can then be assayed for the presence of 
monoclonal antibodies directed against PR0842. Preferably, the binding specificity of monoclonal antibodies 
5 produced by the hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are 
known in the art. The binding affinity of the monoclonal antibody can, for example, be determined by the 
Scatchard analysis of Munson and Pollard, Anal. Biochem. . 107:220 (1980). 

After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution 

10 procedures and grown by standard methods [Goding, supra]. Suitable culture media for this purpose include, for 
example, Dulbecco's Modified Eagle's Medium and RPMI- 1640 medium. Alternatively, the hybridoma cells may 
be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture medium 
or ascites fluid by conventional immunoglobulin purification procedures such as, for example, protein 

15 A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as those described 
in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can be readily isolated 
and sequenced using conventional procedures (e.g., by using oligonucleotide probes that are capable of binding 
specifically to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells of the 

20 invention serve as a preferred source of such DNA. Once isolated, the DNA may be placed into expression 
vectors, which are then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, 
or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal 
antibodies in the recombinant host cells. The DNA also may be modified, for example, by substituting the coding 
sequence for human heavy and light chain constant domains in place of the homologous murine sequences [U.S. 

25 Patent No. 4,816,567; Morrison et al., supra] or by covalently joining to the immunoglobulin coding sequence 
all or part of the coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin polypeptide 
can be substituted for the constant domains of an antibody of the invention, or can be substituted for the variable 
domains of one antigen-combining site of an antibody of the invention to create a chimeric bivalent antibody. 

The antibodies may be monovalent antibodies. Methods for preparing monovalent antibodies are well 

30 known in the art. For example, one method involves recombinant expression of immunoglobulin light chain and 
modified heavy chain. The heavy chain is truncated generally at any point in the Fc region so as to prevent heavy 
chain crosslinking. Alternatively, the relevant cysteine residues are substituted with another amino acid residue 
or are deleted so as to prevent crosslinking. 

In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies to 

35 produce fragments thereof, particularly, Fab fragments, can be accomplished using routine techniques known in 
the art. 
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3. Human and Humanized Antibodies 

The anti-PR0842 antibodies of the invention may further comprise humanized antibodies or human 
antibodies. Humanized forms of non-human (e.g., murine) antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab') 2 or other antigen-binding 
subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. 
5 Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a 
complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non-human 
species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. In some 
instances, Fv framework residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Humanized antibodies may also comprise residues which are found neither in the recipient antibody nor 

10 in the imported CDR or framework sequences. In general, the humanized antibody will comprise substantially 
all of at least one, and typically two, variable domains, in which all or substantially all of the CDR regions 
correspond to those of a non-human immunoglobulin and all or substantially all of the FR regions are those of a 
human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at least a 
portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin [Jones et al., 

15 Nature . 321:522-525 (1986); Riechmann etaL, Nature . 332:323-329 (1988); and Presta, Curr. Op. Struct. Biol. . 
2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art. Generally, a humanized 
antibody has one or more amino acid residues introduced into it from a source which is non-human. These 
non-human amino acid residues are often referred to as "import" residues, which are typically taken from an 

20 "import" variable domain. Humanization can be essentially performed following the method of Winter and 
co-workers [Jones etaL. Nature . 321:522-525 (1986); Riechmann etaL, Nature . 332:323-327 (1988); Verhoeyen 
et aL, Science . 239:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the corresponding 
sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Patent 
No. 4,816,567), wherein substantially less than an intact human variable domain has been substituted by the 

25 corresponding sequence from a non-human species. In practice, humanized antibodies are typically human 
antibodies in which some CDR residues and possibly some FR residues are substituted by residues from analogous 
sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the art, including phage 
display libraries [Hoogenboom and Winter, J. Mol. Biol .. 227:381 (1991); Marks etaL, J. Mol. Biol. . 222:581 

30 (1991)]. The techniques of Cole et al., and Boerner et aL, are also available for the preparation of human 
monoclonal antibodies (Cole etaL, Monoclonal Antibodies and Cancer Therapy . Alan R. Liss, p. 77 (1985) and 
Boerner et al. , J. Immunol. . 147(l) :86-95 (1991)]. Similarly, human antibodies can be made by introducing of 
human immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous immunoglobulin genes 
have been partially or completely inactivated. Upon challenge, human antibody production is observed, which 

35 closely resembles that seen in humans in all respects, including gene rearrangement, assembly, and antibody 
repertoire. This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 
5,625,126; 5,633,425; 5,661,016, and in the following scientific publications: Marks et aL, Bio/Technology. 
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10, 779-783 (1992); Lonberg et al., Nature. 368: 856-859 (1994); Morrison, Nature. 368 : 812-13 (1994); 
Fishwild et al., Nature Biotechnology. 14: 845-51 (1996); Neuberger, Nature Biotechnology. 14: 826 (1996); 
Lonberg and Huszar, Intern. Rev. Immunol.. 13 : 65-93 (1995). 

The antibodies may also be affinity matured using known selection and/or mutagenesis methods as 
described above. Preferred affinity matured antibodies have an affinity which is five times, more preferably 10 
5 times, even more preferably 20 or 30 times greater than the starting antibody (generally murine, humanized or 
human) from which the matured antibody is prepared. 




4. Bispecific Antibodies 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
10 specificities for at least two different antigens. In the present case, one of the binding specificities is for the 
PR0842, the other one is for any other antigen, and preferably for a cell-surface protein or receptor or receptor 
subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the recombinant production 
of bispecific antibodies is based on the co-expression of two immunoglobulin heavy-chain/light-chain pairs, where 

15 the two heavy chains have different specificities [Milstein and Cuello, Nature . 305:537-539 (1983)]. Because of 
the random assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a 
potential mixture of ten different antibody molecules, of which only one has the correct bispecific structure. The 
purification of the correct molecule is usually accomplished by affinity chromatography steps. Similar procedures 
are disclosed in WO 93/08829, published 13 May 1993, and in Traunecker et al., EMBO J. . 10:3655-3659 

20 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen combining sites) can 
be fused to immunoglobulin constant domain sequences. The fusion preferably is with an immunoglobulin 
heavy-chain constant domain, comprising at least part of the hinge, CH2, and CH3 regions. It is preferred to 
have the first heavy -chain constant region (CHI) containing the site necessary for light-chain binding present in 

25 at least one of the fusions. DNAs encoding the immunoglobulin heavy -chain fusions and, if desired, the 
immunoglobulin light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 
host organism. For further details of generating bispecific antibodies see, for example, Suresh et al., Methods 
in Enzvmologv . 121:210 (1986). 

According to another approach described in WO 96/27011, the interface between a pair of antibody 

30 molecules can be engineered to maximize the percentage of heterodimers which are recovered from recombinant 
cell culture. The preferred interface comprises at least a part of the CH3 region of an antibody constant domain. 
In this method, one or more small amino acid side chains from the interface of the first antibody molecule are 
replaced with larger side chains {e.g., tyrosine or tryptophan). Compensatory "cavities" of identical or similar 
size to the large side chain(s) are created on the interface of the second antibody molecule by replacing large 

35 amino acid side chains with smaller ones (e.g., alanine or threonine). This provides a mechanism for increasing 
the yield of the heterodimer over other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g., F(ab')2 



bispecific antibodies). Techniques for generating bispecific antibodies from antibody fragments have been 
described in the literature. For example, bispecific antibodies can be prepared can be prepared using chemical 
linkage . Brennan et al. , Science, 229 : 81(1 985) describe a procedure wherein intact antibodies are proteolytically 
cleaved to generate F(ab')2 fragments. These fragments are reduced in the presence of the dithiol complexing 
agent sodium arsenite to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' 
fragments generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB derivatives 
is then reconverted to the Fab' -thiol by reduction with mercaptoethylamine and is mixed with an equimolar 
amount of the other Fab'-TNB derivative to form the bispecific antibody. The bispecific antibodies produced can 
be used as agents for the selective immobilization of enzymes. 

Fab' fragments may be directly recovered from E. coli and chemically coupled to form bispecific 
antibodies. Shalaby et al., J. Exp. Med.. 175:217-225 (1992) describe the production of a fully humanized 
bispecific antibody F(ab')2 molecule. Each Fab' fragment was separately secreted from E. coli and subjected 
to directed chemical coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was 
able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic 
activity of human cytotoxic lymphocytes against human breast tumor targets. 

Various technique for making and isolating bispecific antibody fragments directly from recombinant cell 
culture have also been described. For example, bispecific antibodies have been produced using leucine zippers. 
Kostelny et al., J. Immunol.. 148(5) : 1547-1553 (1992). The leucine zipper peptides from the Fos and Jun 
proteins were linked to the Fab' portions of two different antibodies by gene fusion. The antibody homodimers 
were reduced at the hinge region to form monomers and then re-oxidized to form the antibody heterodimers. This 
method can also be utilized for the production of antibody homodimers. The "diabody" technology described by 
Hollinger et al., Proc. Natl. Acad. Sci. USA. 90:6444-6448 (1993) has provided an alternative mechanism for 
making bispecific antibody fragments. The fragments comprise a heavy-chain variable domain (VH) connected 
to a light-chain variable domain (VL) by a linker which is too short to allow pairing between the two domains on 
the same chain. Accordingly, the VH and VL domains of one fragment are forced to pair with the complementary 
VL and VH domains of another fragment, thereby forming two antigen-binding sites. Another strategy for 
making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been reported. See, 
Gruber et al. , J. Immunol.. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific antibodies can be 
prepared. Tutt et al. , J. Immunol.. 147:60 (1991). 

Exemplary bispecific antibodies may bind to two different epitopes on a given PR0842 polypeptide 
herein. Alternatively, an anti-PR0842 polypeptide arm may be combined with an arm which binds to a triggering 
molecule on a leukocyte such as a T-cell receptor molecule (e.g., CD2, CD3, CD28, or B7), or Fc receptors for 
IgG (FcgR), such as FcgRI (CD64), FcgRII (CD32) and FcgRIII (CD16) so as to focus cellular defense 
mechanisms to the cell expressing the particular PR0842 polypeptide. Bispecific antibodies may also be used to 
localize cytotoxic agents to cells which express a particular PR0842 polypeptide. These antibodies possess a 
PR0842-binding arm and an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, 
DPTA, DOT A, or TETA. Another bispecific antibody of interest binds the PR0842 polypeptide and further 
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binds tissue factor (TF). 



5. Heteroconjugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate antibodies 
are composed of two covalently joined antibodies. Such antibodies have, for example, been proposed to target 
5 immune system cells to unwanted cells [U.S. Patent No. 4,676,980], and for treatment of HIV infection [WO 
91/00360; WO 92/200373; EP 03089]. It is contemplated that the antibodies may be prepared in vitro using 
known methods in synthetic protein chemistry, including those involving crosslinking agents. For example, 
immunotoxins may be constructed using a disulfide exchange reaction or by forming a thioether bond. Examples 
of suitable reagents for this purpose include iminothiolate and methyl-4-mercaptobutyrimidate and those disclosed, 
10 for example, in U.S. Patent No. 4,676,980. 



6. Effector Function Engineering 

It may be desirable to modify the antibody of the invention with respect to effector function, so as to 
enhance, e.g., the effectiveness of the antibody in treating cancer. For example, cysteine residue(s) may be 

15 introduced into the Fc region, thereby allowing interchain disulfide bond formation in this region. The 
homodimeric antibody thus generated may have improved internalization capability and/or increased 
complement-mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et al , J. Exp 
Med. . 176 : 1191-1195 (1992) and Shopes, J. Immunol. . 148 : 2918-2922 (1992). Homodimeric antibodies with 
enhanced anti-tumor activity may also be prepared using heterobifunctional cross-linkers as described in Wolff 

20 et al., Cancer Research . 53: 2560-2565 (1993). Alternatively, an antibody can be engineered that has dual Fc 
regions and may thereby have enhanced complement lysis and ADCC capabilities. See Stevenson et al, 
Anti-Cancer Drug Design . 3: 219-230 (1989). 



7. Immunoconjugates 

25 The invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic agent 

such as a chemotherapeutic agent, toxin {e.g. , an enzymatically active toxin of bacterial, fungal, plant, or animal 
origin, or fragments thereof), or a radioactive isotope (i.e., a radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been described above. 
Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain, nonbinding active 

30 fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, 
modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, 
PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, 
mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of radionuclides are available 
for the production of radioconjugated antibodies. Examples include 212 Bi, I3I I, 131 In, 90 Y, and l86 Re. 

35 Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional protein-coupling 

agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), iminothiolane (IT), bifunctional derivatives 
of imidoesters (such as dimethyl adipimidate HC1), active esters (such as disuccinimidyl suberate), aldehydes 
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(such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium 
derivatives (such as bis-(p-diazoiuumbenzoyl)-ethylenediarnine), diisocyanates (such as tolyene 2,6-diisocyanate), 
and bis-active fluorine compounds (such as 1 ,5-difluoro-2,4-dinitrobenzene). For example, a ricin immunotoxin 
can be prepared as described in Vitetta et al., Science , 23il098 (1987). Carbon- 14-labeled 
l-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent 
for conjugation of radionucleotide to the antibody. See W094/ 11026. 

In another embodiment, the antibody may be conjugated to a "receptor" (such streptavidin) for utilization 
in tumor pretargeting wherein the antibody-receptor conjugate is administered to the patient, followed by removal 
of unbound conjugate from the circulation using a clearing agent and then administration of a "ligand" (e.g., 
avidin) that is conjugated to a cytotoxic agent (e.g., a radionucleotide). 

8. Immunoliposomes 

The antibodies disclosed herein may also be formulated as immunoliposomes. Liposomes containing 
the antibody are prepared by methods known in the art, such as described in Epstein et al. , Proc. Natl. Acad. Sci. 
USA . 82:3688 (1985); Hwang etal. Proc. Natl Acad. Sci. USA . 77:4030(1980); and U.S. Pat. Nos. 4,485,045 
and 4,544,545. Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation method with a lipid 
composition comprising phosphatidylcholine, cholesterol, and PEG-derivatized phosphatidylethanolamine 
(PEG-PE). Liposomes are extruded through filters of defined pore size to yield liposomes with the desired 
diameter. Fab' fragments of the antibody of the present invention can be conjugated to the liposomes as described 
in Martin et al. , J. Biol. Chem. . 257 : 286-288 (1982) via a disulfide-interchange reaction. A chemotherapeutic 
agent (such as Doxorubicin) is optionally contained within the liposome. See Gabizon et al. , J. National Cancer 
Inst. . 81(19) : 1484 (1989). 

9. Uses for anti-PRQ842 Antibodies 

The anti-PR0842 antibodies of the invention have various utilities. For example, anti-PR0842 antibodies 
may be used in diagnostic assays for PR0842, e.g., detecting its expression (and in some cases, differential 
expression) in specific cells, tissues, or serum. Various diagnostic assay techniques known in the art may be used, 
such as competitive binding assays, direct or indirect sandwich assays and immunoprecipitation assays conducted 
in either heterogeneous or homogeneous phases [Zola, Monoclonal Antibodies: A Manual of Techniques . CRC 
Press, Inc. (1987) pp. 147-158]. The antibodies used in the diagnostic assays can be labeled with a detectable 
moiety. The detectable moiety should be capable of producing, either directly or indirectly, a detectable signal. 
For example, the detectable moiety may be a radioisotope, such as 3 H, 14 C, 32 P, 35 S, or 125 I, a fluorescent or 
chemiluminescent compound, such as fluorescein isothiocyanate, rhodamine, or luciferin, or an enzyme, such as 
alkaline phosphatase, beta-galactosidase or horseradish peroxidase. Any method known in the art for conjugating 
the antibody to the detectable moiety may be employed, including those methods described by Hunter et al., 
Nature . 144:945 (1962); David et al., Biochemistry . 13:1014 (1974); Pain et al., J. Immunol. Meth. . 40:219 
(1981); and Nygren, J. Histochem. and Cvtochem. . 30:407 (1982). 
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Anti-PR0842 antibodies also are useful for the affinity purification of PR0842 from recombinant cell 
culture or natural sources. In this process, the antibodies against PR0842 are immobilized on a suitable support, 
such a Sephadex resin or filter paper, using methods well known in the art. The immobilized antibody then is 
contacted with a sample containing the PR0842 to be purified, and thereafter the support is washed with a suitable 
solvent that will remove substantially all the material in the sample except the PR0842, which is bound to the 
5 immobilized antibody. Finally, the support is washed with another suitable solvent that will release the PR0842 
from the antibody. 

The following examples are offered for illustrative purposes only, and are not intended to limit the scope 
of the present invention in any way. 

All patent and literature references cited in the present specification are hereby incorporated by reference 
10 in their entirety. 

N. Pharmaceutical Compositions 

The active PR0842 molecules of the invention (e.g., PR0842 polypeptides, anti-PR0842 antibodies, 
and/or variants of each) as well as other molecules identified by the screening assays disclosed above, can be 

15 administered for the treatment of immune related diseases, in the form of pharmaceutical compositions. 

Therapeutic formulations of the active PR0842 molecule, preferably a polypeptide or antibody of the invention, 
are prepared for storage by mixing the active molecule having the desired degree of purity with optional 
pharmaceutically acceptable carriers, excipients or stabilizers (Remington's Pharmaceutical Sciences 16th edition, 
Osol, A. Ed. [1980]), in the form of lyophilized formulations or aqueous solutions. Acceptable carriers, 

20 excipients, or stabilizers are nontoxic to recipients at the dosages and concentrations employed, and include 
buffers such as phosphate, citrate, and other organic acids; antioxidants including ascorbic acid and methionine; 
preservatives (such as octadecyldimethylbenzyl ammonium chloride; hexamethonium chloride; benzalkonium 
chloride, benzethonium chloride; phenol, butyl or benzyl alcohol; alkyl parabens such as methyl or propyl 
paraben; catechol; resorcinol; cyclohexanol; 3-pentanol; and m-cresol); low molecular weight (less than about 

25 10 residues) polypeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic polymers 
such as polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, histidine, arginine, or lysine; 
monosaccharides, disaccharides, and other carbohydrates including glucose, mannose, or dextrins; chelating 
agents such as EDTA; sugars such as sucrose, mannitol, trehalose or sorbitol; salt-forming counter-ions such as 
sodium; metal complexes (e.g., Zn-protein complexes); and/or non-ionic surfactants such as TWEEN™, 

30 PLURONICS™ or polyethylene glycol (PEG). 

Compounds identified by the screening assays disclosed herein can be formulated in an analogous 
manner, using standard techniques well known in the art. 

Lipofections or liposomes can also be used to deliver the PR0842 molecule into cells. Where antibody 
fragments are used, the smallest inhibitory fragment which specifically binds to the binding domain of the target 

35 protein is preferred. For example, based upon the variable region sequences of an antibody, peptide molecules 
can be designed which retain the ability to bind the target protein sequence. Such peptides can be synthesized 
chemically and/or produced by recombinant DNA technology (see, e.g. , Marasco et al. , Proc. Natl. Acad. Sci. 
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USA, 90:7889-7893 [1993]). 

The formulation herein may also contain more than one active compound as necessary for the particular 
indication being treated, preferably those with complementary activities that do not adversely affect each other. 
Alternatively, or in addition, the composition may comprise a cytotoxic agent, cytokine or growth inhibitory 
agent. Such molecules are suitably present in combination in amounts that are effective for the purpose intended. 
5 The active PR0842 molecules may also be entrapped in microcapsules prepared, for example, by 

coacervation techniques or by interfacial polymerization, for example, hydroxymethylcellulose or 
gelatin-microcapsules and poly-(methylmethacylate) microcapsules, respectively, in colloidal drug delivery 
systems (for example, liposomes, albumin microspheres, microemulsions, nano-particles and nanocapsules) or 
in macroemulsions. Such techniques are disclosed in Remington's Pharmaceutical Sciences 16th edition, Osol, 
10 A. Ed. (1980). 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished by 
filtration through sterile filtration membranes. 

Sustained-release preparations or the PR0842 molecules may be prepared. Suitable examples of 
sustained-release preparations include semipermeable matrices of solid hydrophobic polymers containing the 
15 antibody, which matrices are in the form of shaped articles, e.g., films, or microcapsules. Examples of 
sustained-release matrices include polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacrylate), or 
poly(vinylalcohol)), polylactides (U.S. Pat. No. 3,773,919), copolymers of L-glutamic acid and 
Y-ethyl-L-glutamate, non-degradable ethylene-vinyl acetate, degradable lactic acid-glycolic acid copolymers such 
as the LUPRON DEPOT™ (injectable microspheres composed of lactic acid-glycolic acid copolymer and 
20 leuprolide acetate), and poly-D-(-)-3-hydroxybutyric acid. While polymers such as ethylene-vinyl acetate and 
lactic acid-glycolic acid enable release of molecules for over 100 days, certain hydrogels release proteins for 
shorter time periods. When encapsulated antibodies remain in the body for a long time, they may denature or 
aggregate as a result of exposure to moisture at 37°C, resulting in a loss of biological activity and possible 
changes in immunogenicity. Rational strategies can be devised for stabilization depending on the mechanism 
25 involved. For example, if the aggregation mechanism is discovered to be intermolecular S-S bond formation 
through thio-disulfide interchange, stabilization may be achieved by modifying sulfhydryl residues, lyophilizing 
from acidic solutions, controlling moisture content, using appropriate additives, and developing specific polymer 
matrix compositions. 

30 O. Methods of Treatment 

It is contemplated that the polypeptides, antibodies and other active compounds of the present invention 
may be used to treat various immune related diseases and conditions, such as T cell mediated diseases, including 
those characterized by infiltration of inflammatory cells into a tissue, stimulation of T-cell proliferation, inhibition 
of T-cell proliferation, increased or decreased vascular permeability or the inhibition thereof. 

35 Exemplary conditions or disorders to be treated with the polypeptides, antibodies and other compounds 

of the invention, include, but are not limited to systemic lupus erythematosis, rheumatoid arthritis, juvenile 
chronic arthritis, osteoarthritis, spondyloarthropathies, systemic sclerosis (scleroderma), idiopathic inflammatory 



83 



myopathies (dermatomyositis, polymyositis), Sjogren's syndrome, systemic vasculitis, sarcoidosis, autoimmune 
hemolytic anemia (immune pancytopenia, paroxysmal nocturnal hemoglobinuria), autoimmune thrombocytopenia 
(idiopathic thrombocytopenic purpura, immune-mediated thrombocytopenia), thyroiditis (Grave's disease, 
Hashimoto's thyroiditis, juvenile lymphocytic thyroiditis, atrophic thyroiditis), diabetes mellitus, 
immune-mediated renal disease (glomerulonephritis, tubulointerstitial nephritis), demyelinating diseases of the 
5 central and peripheral nervous systems such as multiple sclerosis, idiopathic demyelinating polyneuropathy or 
Guillain-Barre syndrome, and chronic inflammatory demyelinating polyneuropathy, hepatobiliary diseases such 
as infectious hepatitis (hepatitis A, B, C, D, E and other non-hepatotropic viruses), autoimmune chronic active 
hepatitis, primary biliary cirrhosis, granulomatous hepatitis, and sclerosing cholangitis, inflammatory bowel 
disease (ulcerative colitis: Crohn's disease), gluten-sensitive enteropathy, and Whipple's disease, autoimmune or 
10 immune-mediated skin diseases including bullous skin diseases, erythema multiforme and contact dermatitis, 
psoriasis, allergic diseases such as asthma, allergic rhinitis, atopic dermatitis, food hypersensitivity and urticaria, 
immunologic diseases of the ovaries, immunologic diseases of the lung such as eosinophilic pneumonia, idiopathic 
pulmonary fibrosis and hypersensitivity pneumonitis, transplantation associated diseases including graft rejection 
and graft -versus-host-disease. 

15 In systemic lupus erythematosus, the central mediator of disease is the production of auto-reactive 

antibodies to self proteins/tissues and the subsequent generation of immune-mediated inflammation. Antibodies 
either directly or indirectly mediate tissue injury. Though T lymphocytes have not been shown to be directly 
involved in tissue damage, T lymphocytes are required for the development of auto-reactive antibodies. The 
genesis of the disease is thus T lymphocyte dependent. Multiple organs and systems are affected clinically 

20 including kidney, lung, musculoskeletal system, mucocutaneous, eye, central nervous system, cardiovascular 
system, gastrointestinal tract, bone marrow and blood. 

Rheumatoid arthritis (RA) is a chronic systemic autoimmune inflammatory disease that mainly involves 
the synovial membrane of multiple joints with resultant injury to. the articular cartilage. The pathogenesis is T 
lymphocyte dependent and is associated with the production of rheumatoid factors, auto-antibodies directed against 

25 self IgG, with the resultant formation of immune complexes that attain high levels in joint fluid and blood. These 
complexes in the joint may induce the marked infiltrate of lymphocytes and monocytes into the synovium and 
subsequent marked synovial changes; the joint space/fluid if infiltrated by similar cells with the addition of 
numerous neutrophils. Tissues affected are primarily the joints, often in symmetrical pattern. However, 
extra-articular disease also occurs in two major forms. One form is the development of extra-articular lesions 

30 with ongoing progressive joint disease and typical lesions of pulmonary fibrosis, vasculitis, and cutaneous ulcers. 
The second form of extra-articular disease is the so called Felty's syndrome which occurs late in the RA disease 
course, sometimes after joint disease has become quiescent, and involves the presence of neutropenia, 
thrombocytopenia and splenomegaly. This can be accompanied by vasculitis in multiple organs with formations 
of infarcts, skin ulcers and gangrene. Patients often also develop rheumatoid nodules in the subcutis tissue 

35 overlying affected joints; the nodules late stage have necrotic centers surrounded by a mixed inflammatory cell 
infiltrate. Other manifestations which can occur in RA include: pericarditis, pleuritis, coronary arteritis, 
interstitial pneumonitis with pulmonary fibrosis, keratoconjunctivitis sicca, and rheumatoid nodules. 
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Juvenile chronic arthritis is a chronic idiopathic inflammatory disease which begins often at less than 16 
years of age. Its phenotype has some similarities to RA; some patients which are rheumatoid factor positive are 
classified as juvenile rheumatoid arthritis. The disease is sub-classified into three major categories: pauciarticular, 
polyarticular, and systemic. The arthritis can be severe and is typically destructive and leads to joint ankylosis 
and retarded growth. Other manifestations can include chronic anterior uveitis and systemic amyloidosis. 
5 Spondyloarthropathies are a group of disorders with some common clinical features and the common 

association with the expression of HLA-B27 gene product. The disorders include: ankylosing spondylitis, Reiter ' s 
syndrome (reactive arthritis), arthritis associated with inflammatory bowel disease, spondylitis associated with 
psoriasis, juvenile onset spondyloarthropathy and undifferentiated spondyloarthropathy. Distinguishing features 
include sacroileitis with or without spondylitis; inflammatory asymmetric arthritis; association with HLA-B27 (a 

10 serologically defined allele of the HLA-B locus of class I MHC); ocular inflammation, and absence of 
autoantibodies associated with other rheumatoid disease. The cell most implicated as key to induction of the 
disease is the CD8 + T lymphocyte, a cell which targets antigen presented by class I MHC molecules. CD8 + T 
cells may react against the class I MHC allele HLA-B27 as if it were a foreign peptide expressed by MHC class 
I molecules. It has been hypothesized that an epitope of HLA-B27 may mimic a bacterial or other microbial 

15 antigenic epitope and thus induce a CD8 + T cells response. 

Systemic sclerosis (scleroderma) has an unknown etiology. A hallmark of the disease is induration of 
the skin; likely this is induced by an active inflammatory process. Scleroderma can be localized or systemic; 
vascular lesions are common and endothelial cell injury in the microvasculature is an early and important event 
in the development of systemic sclerosis; the vascular injury may be immune mediated. An immunologic basis 

20 is implied by the presence of mononuclear cell infiltrates in the cutaneous lesions and the presence of anti-nuclear 
antibodies in many patients. ICAM-1 is often upregulated on the cell surface of fibroblasts in skin lesions 
suggesting that T cell interaction with these cells may have a role in the pathogenesis of the disease. Other organs 
involved include: the gastrointestinal tract: smooth muscle atrophy and fibrosis resulting in abnormal 
peristalsis/motility; kidney: concentric subendothelial intimal proliferation affecting small arcuate and interlobular 

25 arteries with resultant reduced renal cortical blood flow, results in proteinuria, azotemia and hypertension; skeletal 
muscle: atrophy, interstitial fibrosis; inflammation; lung: interstitial pneumonitis and interstitial fibrosis; and 
heart: contraction band necrosis, scarring/fibrosis. 

Idiopathic inflammatory myopathies including dermatomyositis, polymyositis and others are disorders 
of chronic muscle inflammation of unknown etiology resulting in muscle weakness. Muscle injury /inflammation 

30 is often symmetric and progressive. Autoantibodies are associated with most forms. These myositis-specific 
autoantibodies are directed against and inhibit the function of components, proteins and RNA's, involved in 
protein synthesis. 

Sjogren's syndrome is due to immune-mediated inflammation and subsequent functional destruction of 
the tear glands and salivary glands. The disease can be associated with or accompanied by inflammatory 
35 connective tissue diseases. The disease is associated with autoantibody production against Ro and La antigens, 
both of which are small RNA-protein complexes. Lesions result in keratoconjunctivitis sicca, xerostomia, with 
other manifestations or associations including biliary cirrhosis, peripheral or sensory neuropathy, and palpable 
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purpura. 

Systemic vasculitis are diseases in which the primary lesion is inflammation and subsequent damage to 
blood vessels which results in ischemia/necrosis/degeneration to tissues supplied by the affected vessels and 
eventual end-organ dysfunction in some cases. Vasculitides can also occur as a secondary lesion or sequelae to 
other immune-inflammatory mediated diseases such as rheumatoid arthritis, systemic sclerosis, etc., particularly 
5 in diseases also associated with the formation of immune complexes. Diseases in the primary systemic vasculitis 
group include: systemic necrotizing vasculitis: polyarteritis nodosa, allergic angiitis and granulomatosis, 
polyangiitis; Wegener's granulomatosis; lymphomatoid granulomatosis; and giant cell arteritis. Miscellaneous 
vasculitides include: mucocutaneous lymph node syndrome (MLNS or Kawasaki's disease), isolated CNS 
vasculitis, Beliefs disease, thromboangiitis obliterans (Buerger's disease) and cutaneous necrotizing venulitis. 
10 The pathogenic mechanism of most of the types of vasculitis listed is believed to be primarily due to the deposition 
of immunoglobulin complexes in the vessel wall and subsequent induction of an inflammatory response either via 
ADCC, complement activation, or both. 

Sarcoidosis is a condition of unknown etiology which is characterized by the presence of epithelioid 
granulomas in nearly any tissue in the body; involvement of the lung is most common. The pathogenesis involves 
15 the persistence of activated macrophages and lymphoid cells at sites of the disease with subsequent chronic 
sequelae resultant from the release of locally and systemically active products released by these cell types. 

Autoimmune hemolytic anemia including autoimmune hemolytic anemia, immune pancytopenia, and 
paroxysmal noctural hemoglobinuria is a result of production of antibodies that react with antigens expressed on 
the surface of red blood cells (and in some cases other blood cells including platelets as well) and is a reflection 
20 of the removal of those antibody coated cells via complement mediated lysis and/or ADCC/Fc-receptor-mediated 
mechanisms. 

In autoimmune thrombocytopenia including thrombocytopenic purpura, and immune-mediated 
thrombocytopenia in other clinical settings, platelet destruction/removal occurs as a result of either antibody or 
complement attaching to platelets and subsequent removal by complement lysis, ADCC or FC-receptor mediated 
25 mechanisms. 

Thyroiditis including Grave's disease, Hashimoto's thyroiditis, juvenile lymphocytic thyroiditis, and 
atrophic thyroiditis, are the result of an autoimmune response against thyroid antigens with production of 
antibodies that react with proteins present in and often specific for the thyroid gland. Experimental models exist 
including spontaneous models: rats (BUF and BB rats) and chickens (obese chicken strain); inducible models: 
30 immunization of animals with either thyroglobulin, thyroid microsomal antigen (thyroid peroxidase). 

Type I diabetes mellitus or insulin-dependent diabetes is the autoimmune destruction of pancreatic islet 
cells; this destruction is mediated by auto-antibodies and auto-reactive T cells. Antibodies to insulin or the insulin 
receptor can also produce the phenotype of insulin-non-responsiveness. 

Immune mediated renal diseases, including glomerulonephritis and tubulointerstitial nephritis, are the 
35 result of antibody or T lymphocyte mediated injury to renal tissue either directly as a result of the production of 
autoreactive antibodies or T cells against renal antigens or indirectly as a result of the deposition of antibodies 
and/or immune complexes in the kidney that are reactive against other, non-renal antigens. Thus other 



immune-mediated diseases that result in the formation of immune-complexes can also induce immune mediated 
renal disease as an indirect sequelae. Both direct and indirect immune mechanisms result in inflammatory 
response that produces/induces lesion development in renal tissues with resultant organ function impairment and 
in some cases progression to renal failure. Both humoral and cellular immune mechanisms can be involved in 
the pathogenesis of lesions. 

5 Demyelinating diseases of the central and peripheral nervous systems, including multiple sclerosis; 

idiopathic demyelinating polyneuropathy or Guillain-Barre syndrome; and chronic inflammatory demyelinating 

polyneuropathy, are believed to have an autoimmune basis and result in nerve demyelination as a result of damage 

caused to oligodendrocytes or to myelin directly. In MS there is evidence to suggest that disease induction and 

progression is dependent on T lymphocytes. Multiple sclerosis is a demyelinating disease that is T 
10 lymphocyte-dependent and has either a relapsing-remitting course or a chronic progressive course. The etiology 

is unknown; however, viral infections, genetic predisposition, environment, and autoimmunity all contribute. 

Lesions contain infiltrates of predominantly T lymphocyte mediated, microglial cells and infiltrating macrophages; 

CD4 + T lymphocytes are the predominant cell type at lesions. The mechanism of oligodendrocyte cell death and 

subsequent demyelination is not known but is likely T lymphocyte driven. 
1 5 Inflammatory and fibrotic lung disease, including eosinophilic pneumonia; idiopathic pulmonary fibrosis, 

and hypersensitivity pneumonitis may involve a disregulated immune-inflammatory response. Inhibition of that 

response would be of therapeutic benefit. 

Autoimmune or immune -mediated skin disease including bullous skin diseases, erythema multiforme, 

and contact dermatitis are mediated by auto-antibodies, the genesis of which is T lymphocyte-dependent. 
20 Psoriasis is a T lymphocyte-mediated inflammatory disease. Lesions contain infiltrates of T lymphocytes, 

macrophages and antigen processing cells, and some neutrophils. 

Allergic diseases, including asthma; allergic rhinitis; atopic dermatitis; food hypersensitivity; and 

urticaria are T lymphocyte dependent. These diseases are predominantly mediated by T lymphocyte induced 

inflammation, IgE mediated-inflammation or a combination of both. 
25 Transplantation associated diseases, including graft rejection and graft- versus -host-disease (GVHD) are 

T lymphocyte-dependent; inhibition of T lymphocyte function is ameliorative. 

Other diseases in which intervention of the immune and/or inflammatory response have benefit are 

infectious disease including but not limited to viral infection (including but not limited to AIDS, hepatitis A, B, 

C, D, E and herpes) bacterial infection, fungal infections, and protozoal and parasitic infections (molecules (or 
30 derivatives/agonists) which stimulate the MLR can be utilized therapeutically to enhance the immune response 

to infectious agents), diseases of immunodeficiency (molecules/derivatives/agonists) which stimulate the MLR 

can be utilized therapeutically to enhance the immune response for conditions of inherited, acquired, infectious 

induced (as in HIV infection), or iatrogenic (i.e., as from chemotherapy) immunodeficiency, and neoplasia. 

It has been demonstrated that some human cancer patients develop an antibody and/or T lymphocyte 
35 response to antigens on neoplastic cells. It has also been shown in animal models of neoplasia that enhancement 

of the immune response can result in rejection or regression of that particular neoplasm. Molecules that enhance 

the T lymphocyte response in the MLR have utility in vivo in enhancing the immune response against neoplasia. 
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Molecules which enhance the T lymphocyte proliferative response in the MLR (or small molecule agonists or 
antibodies that affected the same receptor in an agonistic fashion) can be used therapeutically to treat cancer. 
Molecules that inhibit the lymphocyte response in the MLR also function in vivo during neoplasia to suppress the 
immune response to a neoplasm; such molecules can either be expressed by the neoplastic cells themselves or their 
expression can be induced by the neoplasm in other cells. Antagonism of such inhibitory molecules (either with 
5 antibody, small molecule antagonists or other means) enhances immune-mediated tumor rejection. 

Additionally, inhibition of molecules with proinflammatory properties may have therapeutic benefit in 
reperfusion injury; stroke; myocardial infarction; atherosclerosis; acute lung injury; hemorrhagic shock; burn; 
sepsis/septic shock; acute tubular necrosis; endometriosis; degenerative joint disease and pancreatitis. 
The compounds of the present invention, e.g., polypeptides or antibodies, are administered to a mammal, 
10 preferably a human, in accord with known methods, such as intravenous administration as a bolus or by 
continuous infusion over a period of time, by intramuscular, intraperitoneal, intracerebral spinal, subcutaneous, 
intra-articular, intra synovial, intrathecal, oral, topical, or inhalation (intranasal, intrapulmonary) routes. 
Intravenous or inhaled administration of polypeptides and antibodies is preferred. 

In immunoadjuvant therapy, other therapeutic regimens, such administration of an anti-cancer agent, may 

15 be combined with the administration of the proteins, antibodies or compounds of the instant invention. For 
example, the patient to be treated with a the immunoadjuvant of the invention may also receive an anti-cancer 
agent (chemotherapeutic agent) or radiation therapy. Preparation and dosing schedules for such chemotherapeutic 
agents may be used according to manufacturers' instructions or as determined empirically by the skilled 
practitioner. Preparation and dosing schedules for such chemotherapy are also described in Chemotherapy 

20 Service, Ed., M.C. Perry, Williams & Wilkins, Baltimore, MD (1992). The chemotherapeutic agent may 
precede, or follow administration of the immunoadjuvant or may be given simultaneously therewith. Additionally, 
an anti-oestrogen compound such as tamoxifen or an anti-progesterone such as onapristone (see, EP 616812) may 
be given in dosages known for such molecules. 

It may be desirable to also administer antibodies against other immune disease associated or tumor 

25 associated antigens, such as antibodies which bind to CD20, CDlla, CD18, ErbB2, EGFR, ErbB3, ErbB4, or 
vascular endothelial factor (VEGF). Alternatively, or in addition, two or more antibodies binding the same or 
two or more different antigens disclosed herein may be coadministered to the patient. Sometimes, it may be 
beneficial to also administer one or more cytokines to the patient. In one embodiment, the PR0842 polypeptides 
are coadministered with a growth inhibitory agent. For example, the growth inhibitory agent may be administered 

30 first, followed by a PR0842 polypeptide. However, simultaneous administration or administration first is also 
contemplated. Suitable dosages for the growth inhibitory agent are those presently used and may be lowered due 
to the combined action (synergy) of the growth inhibitory agent and the PR0842 polypeptide. 
For the treatment or reduction in the severity of immune related disease, the appropriate dosage of an a compound 
of the invention will depend on the type of disease to be treated, as defined above, the severity and course of the 

35 disease, whether the agent is administered for preventive or therapeutic purposes, previous therapy, the patient's 
clinical history and response to the compound, and the discretion of the attending physician. The compound is 
suitably administered to the patient at one time or over a series of treatments. 
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For example, depending on the type and severity of the disease, about 1 mg/kg to 15 mg/kg {e.g. , 0. 1-20 
mg/kg) of polypeptide or antibody is an initial candidate dosage for administration to the patient, whether, for 
example, by one or more separate administrations, or by continuous infusion. A typical daily dosage might range 
from about 1 mg/kg to 100 mg/kg or more, depending on the factors mentioned above. For repeated 
administrations over several days or longer, depending on the condition, the treatment is sustained until a desired 
5 suppression of disease symptoms occurs. However, other dosage regimens may be useful. The progress of this 
therapy is easily monitored by conventional techniques and assays. 



P. Articles of Manufacture 

In another embodiment of the invention, an article of manufacture containing materials {e.g. , comprising 
10 a PR0842 molecule) useful for the diagnosis or treatment of the disorders described above is provided. The 
article of manufacture comprises a container and an instruction. Suitable containers include, for example, bottles, 
vials, syringes, and test tubes. The containers may be formed from a variety of materials such as glass or plastic. 
The container holds a composition which is effective for diagnosing or treating the condition and may have a 
sterile access port (for example the container may be an intravenous solution bag or a vial having a stopper 
15 pierceable by a hypodermic injection needle). The active agent in the composition is usually a polypeptide or an 
antibody of the invention. An instruction or label on, or associated with, the container indicates that the 
composition is used for diagnosing or treating the condition of choice. The article of manufacture may further 
comprise a second container comprising a pharmaceutically-acceptable buffer, such as phosphate-buffered saline, 
Ringer's solution and dextrose solution. It may further include other materials desirable from a commercial and 
20 user standpoint, including other buffers, diluents, filters, needles, syringes, and package inserts with instructions 
for use. 



Q. Diagnosis and Prognosis of Immune Related Disease 

Cell surface proteins, such as proteins which are overexpressed in certain immune related diseases, are 
25 excellent targets for drug candidates or disease treatment. The same proteins along with secreted proteins encoded 
by the genes amplified in immune related disease states find additional use in the diagnosis and prognosis of these 
diseases. For example, antibodies directed against the protein products of genes amplified in multiple sclerosis, 
rheumatoid arthritis, inflammatory bowel disorder, or another immune related disease, can be used as diagnostics 
or prognostics. 

30 For example, antibodies, including antibody fragments, can be used to qualitatively or quantitatively 

detect the expression of proteins encoded by amplified or overexpressed genes ("marker gene products"). The 
antibody preferably is equipped with a detectable, e.g., fluorescent label, and binding can be monitored by light 
microscopy, flow cytometry, fluorimetry, or other techniques known in the art. These techniques are particularly 
suitable, if the overexpressed gene encodes a cell surface protein Such binding assays are performed essentially 

35 as described above. 

In situ detection of antibody binding to the marker gene products can be performed, for example, by 
immunofluorescence or immunoelectron microscopy. For this purpose, a histological specimen is removed from 
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the patient, and a labeled antibody is applied to it, preferably by overlaying the antibody on a biological sample. 
This procedure also allows for determining the distribution of the marker gene product in the tissue examined. 
It will be apparent for those skilled in the art that a wide variety of histological methods are readily available for 
in situ detection. 

The following examples are offered for illustrative purposes only, and are not intended to limit the scope 
5 of the present invention in any way. 

All patent and literature references cited in the present specification are hereby incorporated by reference 
in their entirety. 

EXAMPLES 

10 Commercially available reagents referred to in the examples were used according to manufacturer's 

instructions unless otherwise indicated. The source of those cells identified in the following examples, and 
throughout the specification, by ATCC accession numbers is the American Type Culture Collection, Manassas, 
VA. 

15 EXAMPLE 1 

Isolation of cDNA Clones Encoding Human PRQ842 
PR0842 polypeptide-encoding nucleic acid sequences were identified by applying a proprietary signal 
sequence finding algorithm developed by Genentech, Inc. (South San Francisco, CA) upon ESTs as well as 
clustered and assembled EST fragments from public (e.g., GenBank) and/or private (LIFESEQ®, Incyte 

20 Pharmaceuticals, Inc., Palo Alto, CA) databases. The signal sequence algorithm computes a secretion signal 
score based on the character of the DNA nucleotides surrounding the first and optionally the second methionine 
codon(s) (ATG) at the 5' -end of the sequence or sequence fragment under consideration. The nucleotides 
following the first ATG must code for at least 35 unambiguous amino acids without any stop codons. If the first 
ATG has the required amino acids, the second is not examined. If neither meets the requirement, the candidate 

25 sequence is not scored. In order to determine whether the EST sequence contains an authentic signal sequence, 
the DNA and corresponding amino acid sequences surrounding the ATG codon are scored using a set of seven 
sensors (evaluation parameters) known to be associated with secretion signals. Use of this algorithm resulted in 
identification of a single Incyte EST cluster sequence designated herein as Incyte EST cluster sequence no. 69572. 
This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which 

30 included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the 
computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins 
were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University 

35 of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated 
DNA54230. 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
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encompassed within the Merck EST clone no. AA477092, the Merck EST clone AA477092 was purchased and 
the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 98 and is herein designated as DNA56855-1447. 

The full length clone shown in Figure 98 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 153-155 and ending at the stop codon found at nucleotide 
5 positions 510-512 (Figure 98; SEQ ID NO: 164). The predicted polypeptide precursor (Figure 99, SEQ ID 
NO: 165) is 119 amino acids long. PR0842 has a calculated molecular weight of approximately 13,819 Daltons 
and an estimated pi of approximately 1 1. 16. Other features of PR0842 include a signal peptide at about amino 
acids 1-22, a potential protein kinase C phosphorylation site at about amino acids 39-41 and two potential N- 
myristoylation sites at about amino acids 27-32 and about amino acids 46-51. 
10 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

alignment analysis of the full-length sequence shown in Figure 98 (SEQ ID NO: 164), evidenced some homology 
between the PR0842 amino acid sequence and the following Dayhoff sequences: CEZK13111, PR80843, 
RAT5HT2X1, S81882 1, A60912, MCU60315J37MC137L, U93422 1, pj>91996, U93462 1, and 
ZN18_HUMAN. 

15 

EXAMPLE 2 
Expression of PRQ842 in E. coli 
This example illustrates preparation of an un glycosylated form of PR0842 polypeptides by recombinant 
expression in E. coli. 

20 The DNA sequence encoding a PR0842 polypeptide is initially amplified using selected PCR primers. 

The primers should contain restriction enzyme sites which correspond to the restriction enzyme sites on the 
selected expression vector. A variety of expression vectors may be employed. An example of a suitable vector 
is pBR322 (derived from E. coli; see Bolivar et al., Gene , 2:95 (1977)) which contains genes for ampicillin and 
tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylated. The PCR amplified 

25 sequences are then ligated into the vector. The vector will preferably include sequences which encode for an 
antibiotic resistance gene, a trp promoter, a polyHis leader (including the first six STII codons, polyHis sequence, 
and enterokinase cleavage site), the PR0842 polypeptide coding region, lambda transcriptional terminator, and 
an argU gene. 

The ligation mixture is then used to transform a selected E. coli strain using the methods described in 
30 Sambrook et al. , supra. Transformants are identified by their ability to grow on LB plates and antibiotic resistant 
colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 
sequencing. 

Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with 
antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The cells are then 
35 grown to a desired optical density, during which the expression promoter is turned on. 

After culturing the cells for several more hours, the cells can be harvested by centrifugation. The cell 
pellet obtained by the centrifugation can be solubilized using various agents known in the art, and the solubilized 
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PR0842 protein can then be purified using a metal chelating column under conditions that allow tight binding of 
the protein. 

PR0842 polypeptides may be expressed in E. coli in a poly-His tagged form, using the following 
procedure. The DNA encoding a PR0842 polypeptide is initially amplified using selected PCR primers. The 
primers will contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected 
5 expression vector, and other useful sequences providing for efficient and reliable translation initiation, rapid 
purification on a metal chelation column, and proteolytic removal with enterokinase. The PCR-amplified, poly- 
His tagged sequences are then ligated into an expression vector, which is used to transform an E. coli host based 
on strain 52 (W3110 fuhA(tonA) Ion galE rpoHts(htpRts) clpP(lacIq). Transformants are first grown in LB 
containing 50 mg/ml carbenicillin at 30°C with shaking until an O.D.600 of 3-5 is reached. Cultures are then 
10 diluted 50-100 fold into CRAP media (prepared by mixing 3.57 g (NH 4 ) 2 S0 4 , 0.71 g sodium citrate«2H20, 1 .07 
g KC1, 5.36 g Difco yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as well as 110 mM MPOS, pH 
7.3, 0.55% (w/v) glucose and 7 mM MgS0 4 ) and grown for approximately 20-30 hours at 30°C with shaking. 
Samples are removed to verify expression by SDS-PAGE analysis, and the bulk culture is centrifuged to pellet 
the cells. Cell pellets are frozen until purification and refolding. 
15 E. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 M 

guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final 
concentrations of 0.1M and 0.02 M, respectively, and the solution is stirred overnight at 4°C. This step results 
in a denatured protein with all cysteine residues blocked by sulfitolization. The solution is centrifuged at 40,000 
► rpm in a Beckman Ultracentrifuge for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate 

20 column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The 
clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column equilibrated in the metal chelate 
column buffer. The column is washed with additional buffer containing 50 mM imidazole (Calbiochem, Utrol 
grade), pH 7.4. The protein is eluted with buffer containing 250 mM imidazole. Fractions containing the desired 
protein are pooled and stored at 4°C. Protein concentration is estimated by its absorbance at 280 nm using the 
25 calculated extinction coefficient based on its amino acid sequence. 

The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer consisting 
of: 20 mM Tris, pH 8.6, 0.3 M NaCl, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM EDTA. Refolding 
volumes are chosen so that the final protein concentration is between 50 to 100 micrograms/ml. The refolding 
solution is stirred gently at 4°C for 12-36 hours. The refolding reaction is quenched by the addition of TFA to 
30 a final concentration of 0.4% (pH of approximately 3). Before further purification of the protein, the solution 
is filtered through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The refolded 
protein is chromatographed on a Poros Rl/H reversed phase column using a mobile buffer of 0. 1 % TFA with 
elution with a gradient of acetonitrile from 10 to 80% . Aliquots of fractions with A280 absorbance are analyzed 
on SDS polyacrylamide gels and fractions containing homogeneous refolded protein are pooled. Generally, the 
35 properly refolded species of most proteins are eluted at the lowest concentrations of acetonitrile since those species 
are the most compact with their hydrophobic interiors shielded from interaction with the reversed phase resin. 
Aggregated species are usually eluted at higher acetonitrile concentrations. In addition to resolving misfolded 
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forms of proteins from the desired form, the reversed phase step also removes endotoxin from the samples. 

Fractions containing the desired folded PR0842 polypeptide are pooled and the acetonitrile removed 
using a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 6.8 with 
0. 14 M sodium chloride and 4% mannitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) resins 
equilibrated in the formulation buffer and sterile filtered. 
5 PR0842 polypeptides were successfully expressed as described above. 

EXAMPLE 3 
Expression of PRQ842 in mammalian cells 
This example illustrates preparation of a potentially glycosylated form of PR0842 polypeptides by 
10 recombinant expression in mammalian cells. 

The vector, pRK5 (see EP 307,247, published March 15, 1989), is employed as the expression vector. 
Optionally, the PR0842 DNA is ligated into pRK5 with selected restriction enzymes to allow insertion of the 
PR0842 DNA using ligation methods such as described in Sambrook et al. , supra. The resulting vector is called 
pRK5-PR0842. 

15 In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are 

grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf serum and 
optionally, nutrient components and/or antibiotics. About 10 fig pRK5-PR0842 DNA is mixed with about 1 fig 
DNA encoding the VA RNA gene [Thimmappaya et al., Cell, 31:543 (1982)] and dissolved in 500 fi\ of 1 mM 
Tris-HCl, 0.1 mM EDTA, 0.227 M CaCl 2 . To this mixture is added, dropwise, 500 fil of 50 mM HEPES (pH 

20 7.35), 280 mM NaCl, 1.5 mM NaP0 4 , and a precipitate is allowed to form for 10 minutes at 25°C. The 
precipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 37°C. The culture 
medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are then washed 
with serum free medium, fresh medium is added and the cells are incubated for about 5 days. 

Approximately 24 hours after the transfections, the culture medium is removed and replaced with culture 

25 medium (alone) or culture medium containing 200 /iCi/ml 35 S-cysteine and 200 /xCi/ml 35 S-methionine. After a 
12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto a 15% 
SDS gel. The processed gel may be dried and exposed to film for a selected period of time to reveal the presence 
of the PR0842 polypeptide. The cultures containing transfected cells may undergo further incubation (in serum 
free medium) and the medium is tested in selected bioassays. 

30 In an alternative technique, PR0842 may be introduced into 293 cells transiently using the dextran sulfate 

method described by Somparyrac et al. , Proc. Natl. Acad. Sci. . 12:7575 (1981). 293 cells are grown to maximal 
density in a spinner flask and 700 fig pRK5-PR0842 DNA is added. The cells are first concentrated from the 
spinner flask by centrifugation and washed with PBS. The DNA-dextran precipitate is incubated on the cell pellet 
for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture medium, and 

35 re-introduced into the spinner flask containing tissue culture medium, 5 /ig/ml bovine insulin and 0. 1 fig/ml bovine 
transferrin. After about four days, the conditioned media is centrifiiged and filtered to remove cells and debris. 
The sample containing the expressed PR0842 polypeptide can then be concentrated and purified by any selected 
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method, such as dialysis and/or column chromatography. 

In another embodiment, PR0842 polypeptides can be expressed in CHO cells. The pRK5-PR0842 can 
be transfected into CHO cells using known reagents such as CaP0 4 or DEAE-dextran. As described above, the 
cell cultures can be incubated, and the medium replaced with culture medium (alone) or medium containing a 
radiolabel such as 35 S-methionine. After determining the presence of the PR0842 polypeptide, the culture 
medium may be replaced with serum free medium. Preferably, the cultures are incubated for about 6 days, and 
then the conditioned medium is harvested. The medium containing the expressed PR0842 polypeptide can then 
be concentrated and purified by any selected method. 

Epitope-tagged PR0842 may also be expressed in host CHO cells. The PR0842 may be subcloned out 
of the pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as 
a poly-His tag into a Baculovirus expression vector. The poly-His tagged PR0842 insert can then be subcloned 
into a SV40 driven vector containing a selection marker such as DHFR for selection of stable clones. Finally, 
the CHO cells can be transfected (as described above) with the SV40 driven vector. Labeling may be performed, 
as described above, to verify expression. The culture medium containing the expressed poly-His tagged PR0842 
can then be concentrated and purified by any selected method, such as by Ni 2+ -chelate affinity chromatography. 

PR0842 polypeptides may also be expressed in CHO and/or COS cells by a transient expression 
procedure or in CHO cells by another stable expression procedure. 

Stable expression in CHO cells is performed using the following procedure. The proteins are expressed 
as an IgG construct (immunoadhesin), in which the coding sequences for the soluble forms {e.g., extracellular 
domains) of the respective proteins are fused to an IgGl constant region sequence containing the hinge, CH2 and 
CH2 domains, and/or as a poly-His tagged form. 

Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector using 
standard techniques as described in Ausubel et al. y Current Protocols of Molecular Biology . Unit 3.16, John 
Wiley and Sons (1997). CHO expression vectors are constructed to have compatible restriction sites 5' and 3' 
of the DNA of interest to allow the convenient shuttling of cDNA's. The vector used in expression in CHO cells 
is as described in Lucas et al. 9 Nucl. Acids Res. . 24:9 (1774-1779 (1996), and uses the SV40 early 
promoter/enhancer to drive expression of the cDNA of interest and dihydrofolate reductase (DHFR). DHFR 
expression permits selection for stable maintenance of the plasmid following transfection. 

Twelve micrograms of the desired plasmid DNA is introduced into approximately 10 million CHO cells 
using commercially available transfection reagents Superfect* (Qiagen), Dosper* or Fugene* (Boehringer 
Mannheim). The cells are grown as described in Lucas et al., supra. Approximately 3 x 10 7 cells are frozen in 
an ampule for further growth and production as described below. 

The ampules containing the plasmid DNA are thawed by placement into water bath and mixed by 
vortexing. The contents are pipetted into a centrifuge tube containing 10 mLs of media and centrifuged at 1000 
rpm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective media (0.2 
//m filtered PS20 with 5% 0.2 /u,m diafiltered fetal bovine serum). The cells are then aliquoted into a 100 mL 
spinner containing 90 mL of selective media. After 1-2 days, the cells are transferred into a 250 mL spinner filled 
with 150 mL selective growth medium and incubated at 37°C. After another 2-3 days, 250 mL, 500 mL and 2000 
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mL spinners are seeded with 3 x 10 5 cells/mL. The cell media is exchanged with fresh media by centrifugation 
and resuspension in production medium. Although any suitable CHO media may be employed, a production 
medium described in U.S. Patent No. 5,122,469, issued June 16, 1992 may actually be used. A 3L production 
spinner is seeded at 1.2 x 10 6 cells/mL. On day 0, the cell number pH ie determined. On day 1, the spinner is 
sampled and sparging with filtered air is commenced. On day 2, the spinner is sampled, the temperature shifted 
to 33°C, and 30 mL of 500 g/L glucose and 0.6 mL of 10% antifoam (e.g. , 35 % polydimethylsiloxane emulsion, 
Dow Corning 365 Medical Grade Emulsion) taken. Throughout the production, the pH is adjusted as necessary 
to keep it at around 7.2. After 10 days, or until the viability dropped below 70%, the cell culture is harvested 
by centrifugation and filtering through a 0.22 ^m filter. The filtrate was either stored at 4°C or immediately 
loaded onto columns for, purification. 

For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). Before 
purification, imidazole is added to the conditioned media to a concentration of 5 mM. The conditioned media is 
pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4, buffer containing 0.3 M NaCl and 
5 mM imidazole at a flow rate of 4-5 ml/min. at 4°C. After loading, the column is washed with additional 
equilibration buffer and the protein eluted with equilibration buffer containing 0.25 M imidazole. The highly 
purified protein is subsequently desalted into a storage buffer containing 10 mM Hepes, 0.14 M NaCl and 4% 
mannitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and stored at -80°C. 

Immunoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. The 
conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 mM 
Na phosphate buffer, pH 6.8. After loading, the column is washed extensively with equilibration buffer before 
elution with 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by collecting 1 ml 
fractions into tubes containing 275 >uL of 1 M Tris buffer, pH 9. The highly purified protein is subsequently 
desalted into storage buffer as described above for the poly-His tagged proteins. The homogeneity is assessed by 
SDS polyacrylamide gels and by N-terminal amino acid sequencing by Edman degradation. 

PR0842 polypeptides were successfully expressed as described above. 

EXAMPLE 4 
Expression of PRQ842 in Yeast 

The following method describes recombinant expression of PR0842 polypeptides in yeast. 

First, yeast expression vectors are constructed for intracellular production or secretion of PR0842 from 
the ADH2/GAPDH promoter. DNA encoding the PR0842 polypeptide and the promoter is inserted into suitable 
restriction enzyme sites in the selected plasmid to direct intracellular expression of the PR0842 polypeptide. For 
secretion, DNA encoding PR0842 can be cloned into the selected plasmid, together with DNA encoding the 
ADH2/GAPDH promoter, a native PR0842 signal peptide or other mammalian signal peptide, or, for example, 
a yeast alpha-factor or invertase secretory signal/leader sequence, and linker sequences (if needed) for expression 
ofPR0842. 

Yeast cells, such as yeast strain AB1 10, can then be transformed with the expression plasmids described 
above and cultured in selected fermentation media. The transformed yeast supernatants can be analyzed by 
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precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of the gels with 
Coomassie Blue stain. 

Recombinant PR0842 polypeptides can subsequently be isolated and purified by removing the yeast cells 
from the fermentation medium by centrifugation and then concentrating the medium using selected cartridge 
filters. The concentrate containing the PR0842 polypeptide may further be purified using selected column 
5 chromatography resins. 

EXAMPLE 5 

Expression of PRQ842 in Baculovirus-Infected Insect Cells 
The following method describes recombinant expression of PR0842 polypeptides in Baculovirus-infected 
10 insect cells. 

The sequence coding for PR0842 is fused upstream of an epitope tag contained within a Baculovirus 
expression vector. Such epitope tags include poly-His tags and immunoglobulin tags (like Fc regions of IgG). 
A variety of plasmids may be employed, including plasmids derived from commercially available plasmids such 
as pVL1393 (Novagen). Briefly, the sequence encoding PR0842 or the desired portion of the coding sequence 

15 of PR0842 such as the sequence encoding the extracellular domain of a transmembrane protein or the sequence 
encoding the mature protein if the protein is extracellular is amplified by PCR with primers complementary to 
the 5' and 3' regions. The 5' primer may incorporate flanking (selected) restriction enzyme sites. The product 
is then digested with those selected restriction enzymes and subcloned into the expression vector. 

Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGold™ virus DNA 

20 (Pharmingen) into Spodoptera frugiperda ("Sf9") cells (ATCC CRL 1711) using lipofectin (commercially 
available from GIBCO-BRL). After 4-5 days of incubation at 28°C, the released viruses are harvested and used 
for further amplifications. Viral infection and protein expression are performed as described by O'Reilley et al. 9 
Baculovirus expression vectors: A Laboratory Manual , Oxford: Oxford University Press (1994). 

Expressed poly-His tagged PR0842 can then be purified, for example, by Ni 2+ -chelate affinity 

25 chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as described by 
Rupert et al., Nature , 362:175-179 (1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer (25 
mL Hepes, pH 7.9; 12.5 mM MgCl 2 ; 0.1 mM EDTA; 10% glycerol; 0.1% NP-40; 0.4 M KC1), and sonicated 
twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the supernatant is diluted 50-fold 
in loading buffer (50 mM phosphate, 300 mM NaCl, 10% glycerol, pH 7.8) and filtered through a 0.45 jum filter. 

30 A Ni 2+ -NTA agarose column (commercially available from Qiagen) is prepared with a bed volume of 5 mL, 
washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The filtered cell extract is loaded 
onto the column at 0.5 mL per minute. The column is washed to baseline A 280 with loading buffer, at which point 
fraction collection is started. Next, the column is washed with a secondary wash buffer (50 mM phosphate; 300 
mM NaCl, 10% glycerol, pH 6.0), which elutes nonspecifically bound protein. After reaching A 280 baseline 

35 again, the column is developed with a 0 to 500 mM Imidazole gradient in the secondary wash buffer. One mL 
fractions are collected and analyzed by SDS-PAGE and silver staining or Western blot with Ni 2+ -NT A -conjugated 
to alkaline phosphatase (Qiagen). Fractions containing the eluted His 10 -tagged PR0842 are pooled and dialyzed 



against loading buffer. 

Alternatively, purification of the IgG tagged (or Fc tagged) PR0842 can be performed using known 
chromatography techniques, including for instance, Protein A or Protein G column chromatography. 
PR0842 polypeptides were successfully expressed as described above. 

5 EXAMPLE 6 

Preparation of Antibodies that Bind PRQ842 
This example illustrates preparation of monoclonal antibodies which can specifically bind PR0842. 
Techniques for producing the monoclonal antibodies are known in the art and are described, for instance, 
in Goding, supra. Immunogens that may be employed include purified PR0842 polypeptides, fusion proteins 
10 containing PR0842 polypeptides, and cells expressing recombinant PR0842 polypeptides on the cell surface. 
Selection of the immunogen can be made by the skilled artisan without undue experimentation. 

Mice, such as BALB/c, are immunized with the PR0842 immunogen emulsified in complete Freund's 
adjuvant and injected subcutaneously or intraperitoneally in an amount from 1-100 micrograms. Alternatively, 
the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research, Hamilton, MT) and 
15 injected into the animal's hind foot pads. The immunized mice are then boosted 10 to 12 days later with 
additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the mice may also be 
boosted with additional immunization injections. Serum samples may be periodically obtained from the mice by 
retro-orbital bleeding for testing in ELISA assays to detect anti-PR0842 antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected 
20 with a final intravenous injection of PR0842. Three to four days later, the mice are sacrificed and the spleen cells 
are harvested. The spleen cells are then fused (using 35 % polyethylene glycol) to a selected murine myeloma cell 
line such as P3X63AgU. 1, available from ATCC, No. CRL 1597. The fusions generate hybridoma cells which 
can then be plated in 96 well tissue culture plates containing HAT (hypoxanthine, aminopterin, and thymidine) 
medium to inhibit proliferation of non-fused cells, myeloma hybrids, and spleen cell hybrids. 
25 The hybridoma cells will be screened in an ELISA for reactivity against PR0842. Determination of 

"positive" hybridoma cells secreting the desired monoclonal antibodies against PR0842 is within the skill in the 
art. 

The positive hybridoma cells can be injected intraperitoneally into syngeneic BALB/c mice to produce 
ascites containing the anti-PR0842 monoclonal antibodies. Alternatively, the hybridoma cells can be grown in 
30 tissue culture flasks or roller bottles. Purification of the monoclonal antibodies produced in the ascites can be 
accomplished using ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, 
affinity chromatography based upon binding of antibody to Protein A or Protein G can be employed. 

EXAMPLE 7 

35 Purification of PRQ842 Polypeptides Using Specific Antibodies 

Native or recombinant PR0842 polypeptides may be purified by a variety of standard techniques in the 
art of protein purification. For example, pro-PR0842 polypeptide, mature PR0842 polypeptide, or pre-PR0842 
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polypeptide is purified by immunoaffinity chromatography using antibodies specific for the PR0842 polypeptide 
of interest. In general, an immunoaffinity column is constructed by covalently coupling the anti-PR0842 
polypeptide antibody to an activated chromatographic resin. 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation with ammonium 
sulfate or by purification on immobilized Protein A (Pharmacia LKB Biotechnology, Piscataway, N.J.). Likewise, 
monoclonal antibodies are prepared from mouse ascites fluid by ammonium sulfate precipitation or 
chromatography on immobilized Protein A. Partially purified immunoglobulin is covalently attached to a 
chromatographic resin such as CnBr-activated SEPHAROSE™ (Pharmacia LKB Biotechnology). The antibody 
is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacturer's 
instructions. 

Such an immunoaffinity column is utilized in the purification of PR0842 polypeptide by preparing a 
fraction from cells containing PR0842 polypeptide in a soluble form. This preparation is derived by solubilization 
of the whole cell or of a subcellular fraction obtained via differential centrifugation by the addition of detergent 
or by other methods well known in the art. Alternatively, soluble PR0842 polypeptide containing a signal 
sequence may be secreted in useful quantity into the medium in which the cells are grown. 

A soluble PR0842 polypeptide-containing preparation is passed over the immunoaffmity column, and 
the column is washed under conditions that allow the preferential absorbance of PR0842 polypeptide (e.g. , high 
ionic strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt 
antibody/PR0842 polypeptide binding (e.g., a low pH buffer such as approximately pH 2-3, or a high 
concentration of a chaotrope such as urea or thiocyanate ion), and PR0842 polypeptide is collected. 

EXAMPLE 8 
Drug Screening 

This invention is particularly useful for screening compounds by using PR0842 polypeptides or binding 
fragment thereof in any of a variety of drug screening techniques. The PR0842 polypeptide or fragment 
employed in such a test may either be free in solution, affixed to a solid support, borne on a cell surface, or 
located intracellularly . One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids expressing the PR0842 polypeptide or fragment. Drugs are screened 
against such transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can be 
used for standard binding assays. One may measure, for example, the formation of complexes between PR0842 
polypeptide or a fragment and the agent being tested. Alternatively, one can examine the diminution in complex 
formation between the PR0842 polypeptide and its target cell or target receptors caused by the agent being tested. 

Thus, the present invention provides methods of screening for drugs or any other agents which can affect 
a PR0842 polypeptide-associated disease or disorder. These methods comprise contacting such an agent with an 
PR0842 polypeptide or fragment thereof and assaying (i) for the presence of a complex between the agent and 
the PR0842 polypeptide or fragment, or (ii) for the presence of a complex between the PR0842 polypeptide or 
fragment and the cell, by methods well known in the art. In such competitive binding assays, the PR0842 
polypeptide or fragment is typically labeled. After suitable incubation, free PR0842 polypeptide or fragment is 
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separated from that present in bound form, and the amount of free or uncomplexed label is a measure of the 
ability of the particular agent to bind to PR0842 polypeptide or to interfere with the PR0842 polypeptide/cell 
complex. 

Another technique for drug screening provides high throughput screening for compounds having suitable 
binding affinity to a polypeptide and is described in detail in WO 84/03564, published on September 13, 1984. 
Briefly stated, large numbers of different small peptide test compounds are synthesized on a solid substrate, such 
as plastic pins or some other surface. As applied to a PR0842 polypeptide, the peptide test compounds are reacted 
with PR0842 polypeptide and washed. Bound PR0842 polypeptide is detected by methods well known in the art. 
Purified PR0842 polypeptide can also be coated directly onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture the peptide and immobilize it on the 
solid support. 

This invention also contemplates the use of competitive drug screening assays in which neutralizing 
antibodies capable of binding PR0842 polypeptide specifically compete with a test compound for binding to 
PR0842 polypeptide or fragments thereof. In this manner, the antibodies can be used to detect the presence of 
any peptide which shares one or more antigenic determinants with PR0842 polypeptide. 

EXAMPLE 9 
Rational Drug Design 

The goal of rational drug design is to produce structural analogs of biologically active polypeptide of 
interest (i.e. , a PR0842 polypeptide) or of small molecules with which they interact, e.g. , agonists, antagonists, 
or inhibitors. Any of these examples can be used to fashion drugs which are more active or stable forms of the 
PR0842 polypeptide or which enhance or interfere with the function of the PR0842 polypeptide in vivo (c.f., 
Hodgson, Bio/Technology . 9: 19-21 (1991)). 

In one approach, the three-dimensional structure of the PR0842 polypeptide, or of an PR0842 
polypeptide-inhibitor complex, is determined by x-ray crystallography, by computer modeling or, most typically, 
by a combination of the two approaches. Both the shape and charges of the PR0842 polypeptide must be 
ascertained to elucidate the structure and to determine active site(s) of the molecule. Less often, useful information 
regarding the structure of the PR0842 polypeptide may be gained by modeling based on the structure of 
homologous proteins. In both cases, relevant structural information is used to design analogous PR0842 
polypeptide-like molecules or to identify efficient inhibitors. Useful examples of rational drug design may include 
molecules which have improved activity or stability as shown by Braxton and Wells, Biochemistry, 31:7796-7801 
(1992) or which act as inhibitors, agonists, or antagonists of native peptides as shown by Athauda et aL, L 
Biochem. . 113:742-746 (1993). 

It is also possible to isolate a target-specific antibody, selected by functional assay, as described above, 
and then to solve its crystal structure. This approach, in principle, yields a pharmacore upon which subsequent 
drug design can be based. It is possible to bypass protein crystallography altogether by generating anti-idiotypic 
antibodies (anti-ids) to a functional, pharmacologically active antibody. As a mirror image of a mirror image, the 
binding site of the anti-ids would be expected to be an analog of the original receptor. The anti-id could then be 
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used to identify and isolate peptides from banks of chemically or biologically produced peptides. The isolated 
peptides would then act as the pharmacore. 

By virtue of the present invention, sufficient amounts of the PR0842 polypeptide may be made available 
to perform such analytical studies as X-ray crystallography. In addition, knowledge of the PR0842 polypeptide 
amino acid sequence provided herein will provide guidance to those employing computer modeling techniques in 
place of or in addition to x-ray crystallography. 

EXAMPLE 10 

Microarrav Analysis to Detect Qverexpression of PRQ842 Polypeptides in Cancerous Tumors 

Nucleic acid microarrays, often containing thousands of gene sequences, are useful for identifying 
differentially expressed genes in diseased tissues as compared to their normal counterparts. Using nucleic acid 
microarrays, test and control mRNA samples from test and control tissue samples are reverse transcribed and 
labeled to generate cDNA probes. The cDNA probes are then hybridized to an array of nucleic acids immobilized 
on a solid support. The array is configured such that the sequence and position of each member of the array is 
known. For example, a selection of genes known to be expressed in certain disease states may be arrayed on a 
solid support. Hybridization of a labeled probe with a particular array member indicates that the sample from 
which the probe was derived expresses that gene. If the hybridization signal of a probe from a test (disease tissue) 
sample is greater than hybridization signal of a probe from a control (normal tissue) sample, the gene or genes 
overexpressed in the disease tissue are identified. The implication of this result is that an overexpressed protein 
in a diseased tissue is useful not only as a diagnostic marker for the presence of the disease condition, but also 
as a therapeutic target for treatment of the disease condition. 

The methodology of hybridization of nucleic acids and microarray technology is well known in the art. 
In the present example, the specific preparation of nucleic acids for hybridization and probes, slides, and 
hybridization conditions are all detailed in U.S. Provisional Patent Application Serial No. 60/193,767, filed on 
March 31, 2000 and which is herein incorporated by reference. 

In the present example, cancerous tumors derived from various human tissues were studied for PR0842 
polypeptide-encoding gene expression relative to non-cancerous human tissue in an attempt to identify those 
PR0842 polypeptides which are overexpressed in cancerous tumors. Two sets of experimental data were 
generated. In one set, cancerous human colon tumor tissue and matched non-cancerous human colon tumor tissue 
from the same patient ("matched colon control") were obtained and analyzed for PR0842 polypeptide expression 
using the above described microarray technology. In the second set of data, cancerous human tumor tissue from 
any of a variety of different human tumors was obtained and compared to a "universal" epithelial control sample 
which was prepared by pooling non-cancerous human tissues of epithelial origin, including liver, kidney, and 
lung. mRNA isolated from the pooled tissues represents a mixture of expressed gene products from these 
different tissues. Microarray hybridization experiments using the pooled control samples generated a linear plot 
in a 2-color analysis. The slope of the line generated in a 2-color analysis was then used to normalize the ratios 
of (testxontrol detection) within each experiment. The normalized ratios from various experiments were then 
compared and used to identify clustering of gene expression. Thus, the pooled "universal control" sample not only 
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allowed effective relative gene expression determinations in a simple 2-sample comparison, it also allowed multi- 
sample comparisons across several experiments. 

In the present experiments, nucleic acid probes derived from the herein described PR0842 polypeptide- 
encoding nucleic acid sequences were used in the creation of the microarray and RNA from the tumor tissues 
listed above were used for the hybridization thereto. A value based upon the normalized ratio: experimental ratio 
was designated as a "cutoff ratio". Only values that were above this cutoff ratio were determined to be 
significant. Table 7 below shows the results of these experiments, demonstrating that various PR0842 



polypeptides of the present invention are significantly overexpressed i^variousjiuman tumor tissues as compared 
to a non-cancerous human tissue control. As described above, these data demonstrate that the PR0842 
polypeptides of the present invention are useful not only as diagnostic markers for the presence of one or more 
10 cancerous tumors, but also serve as therapeutic tar gets for the treatment of those tumors. 



Table 7 
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Molecule 
PR0842 
PR0842 
PR0842 



is overexpressed in: 
colon rumor 
lung tumor 
breast tumor 



as compared to: 
universal normal control 
universal normal control 
universal normal control 



n 



EXAMPLE 11 
In situ Hybridization 

20 In situ hybridization is a powerful and versatile technique for the detection and localization of nucleic 

acid sequences within cell or tissue preparations. It may be useful, for example, to identify sites of gene 
expression, analyze the tissue distribution of transcription, identify and localize viral infection, follow changes 
in specific mRNA synthesis and aid in chromosome mapping. 

In situ hybridization was performed following an optimized version of the protocol by Lu and Gillett, 
25 Cell Vision . 1:169-176 (1994), using PCR-generated 33 P-labeled riboprobes. Briefly, formalin-fixed, paraffin- 
embedded human tissues were sectioned, deparaffinized, deproteinated in proteinase K (20 g/ml) for 15 minutes 
at 37°C, and further processed for in situ hybridization as described by Lu and Gillett, supra. A [ 33 -P] UTP- 
labeled antisense riboprobe was generated from a PCR product and hybridized at 55 °C overnight. The slides 
were dipped in Kodak NTB2 nuclear track emulsion and exposed for 4 weeks. 
30 3 3 P-Riboprobe synthesis 

6.0 fil (125 mCi) of 33 P-UTP (Amersham BF 1002, SA < 2000 Ci/mmol) were speed vac dried. To each 
tube containing dried 33 P-UTP, the following ingredients were added: 

2.0 fi\ 5x transcription buffer 

1.0 fil DTT (100 mM) 

35 2.0 id NTP mix (2.5 mM : 10 id; each of 10 mM GTP, CTP & ATP 4- 10 fd H 2 0) 

1.0 /il UTP (50 fiM) 
1.0 id Rnasin 
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1.0 /*1 DNA template (l^g) 
1.0 fil H 2 0 

1.0 fil RNA polymerase (for PCR products T3 = AS, T7 = S, usually) 

The tubes were incubated at 37°C for one hour. 1 .0 ^1 RQ1 DNase were added, followed by incubation 
at 37°C for 15 minutes. 90 /xl TE (10 mM Tris pH 7.6/lmM EDTA pH 8.0) were added, and the mixture was 
5 pipetted onto DE81 paper. The remaining solution was loaded in a Microcon-50 ultrafiltration unit, and spun 
using program 10 (6 minutes). The filtration unit was inverted over a second tube and spun using program 2 (3 
minutes). After the final recovery spin, 100 fil TE were added. 1 fil of the final product was pipetted on DE81 
paper and counted in 6 ml of Biofluor II. 

The probe was run on a TBE/urea gel. 1-3 {x\ of the probe or 5 jx\ of RNA Mrk III were added to 3 fil 
10 of loading buffer. After heating on a 95° C heat block for three minutes, the gel was immediately placed on ice. 
The wells of gel were flushed, the sample loaded, and run at 180-250 volts for 45 minutes. The gel was wrapped 
in saran wrap and exposed to XAR film with an intensifying screen in -70°C freezer one hour to overnight. 
3 3 P -Hybridization 

A. Pretreatment of frozen sections 

15 The slides were removed from the freezer, placed on aluminium trays and thawed at room temperature 

for 5 minutes. The trays were placed in 55°C incubator for five minutes to reduce condensation. The slides were 
fixed for 10 minutes in 4% paraformaldehyde on ice in the fume hood, and washed in 0.5 x SSC for 5 minutes, 
at room temperature (25 ml 20 x SSC + 975 ml SQ H 2 0). After deproteination in 0.5 ^g/ml proteinase K for 
10 minutes at 37°C (12.5 fil of 10 mg/ml stock in 250 ml prewarmed RNase-free RNAse buffer), the sections 

20 were washed in 0.5 x SSC for 10 minutes at room temperature. The sections were dehydrated in 70%, 95%, 
100% ethanol, 2 minutes each. 

B. Pretreatment of paraffin-embedded sections 

The slides were deparaffinized, placed in SQ H 2 0, and rinsed twice in 2 x SSC at room temperature, 
for 5 minutes each time. The sections were deproteinated in 20 ftg/ml proteinase K (500 fil of 10 mg/ml in 250 
25 ml RNase-free RNase buffer; 37°C, 15 minutes) - human embryo, or 8 x proteinase K (100 fil in 250 ml Rnase 
buffer, 37°C, 30 minutes) - formalin tissues. Subsequent rinsing in 0.5 x SSC and dehydration were performed 
as described above. 

C. Prehvbridization 

The slides were laid out in a plastic box lined with Box buffer (4 x SSC, 50% formamide) - saturated 
30 filter paper. The tissue was covered with 50 ix\ of hybridization buffer (3.75g Dextran Sulfate + 6 ml SQ H 2 0), 
vortexed and heated in the microwave for 2 minutes with the cap loosened. After cooling on ice, 18.75 ml 
formamide, 3.75 ml 20 x SSC and 9 ml SQ H 2 0 were added, the tissue was vortexed well, and incubated at 42°C 
for 1-4 hours. 

D. Hybridization 

35 1.0 x 10 6 cpm probe and 1.0 fil tRNA (50 mg/ml stock) per slide were heated at 95°C for 3 minutes. 

The slides were cooled on ice, and 48 fx\ hybridization buffer were added per slide. After vortexing, 50 fil 33 P 
mix were added to 50 /xl prehybridization on slide. The slides were incubated overnight at 55°C. 
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E. Washes 

Washing was done 2 x 10 minutes with 2xSSC, EDTA at room temperature (400 ml 20 x SSC + 16 ml 
0.25M EDTA, V f =4L), followed by RNaseA treatment at 37°C for 30 minutes (500 fi\ of 10 mg/ml in 250 ml 
Rnase buffer = 20 /xg/ml), The slides were washed 2 x 10 minutes with 2 x SSC, EDTA at room temperature. 
The stringency wash conditions were as follows: 2 hours at 55°C, 0. 1 x SSC, EDTA (20 ml 20 x SSC + 16 ml 
5 EDTA, V f =4L). 

F. Oligonucleotides 

In situ analysis was performed on DNA59294-1381 disclosed herein. The oligonucleotides employed 
for this analysis were derived from the nucleotide sequences disclosed herein and generally range from about 40 
to 55 nucleotides in length. 
10 G. Results 

In situ analysis was performed on DNA59294-1381 as disclosed herein. The results from this analysis 
is as follows. 

Expression of DNA56855 in tissue screen: adult inflamed lung and ovary 

Sections show an intense signal associated with bronchial epithelium in normal and inflamed adult lungs , 
15 including asthma and interstitial pneumonitis blocks. The lung tumor multiblock has intense signal over epithelial 
cells lining glandular structures that comprise the neoplasm. The fetal lung showed negative results. In a section 
of ovary, tubal epithelium adjacent to the ovarian stroma is positive. 

EXAMPLE 12 

20 Threading Analysis of PRQ842 (CK27) Suggests Structural Homology to Interleukin-8 

Threading methods have demonstrated high sensitivity on prediction of structure for novel sequences. 
These techniques are able to detect if a protein sequence resembles known structures without relying on sequence 
comparison, thus being able to identify relationships amoung proteins even if the sequence similarity between them 
is low. The present study demonstrated that threading techniques can detect lower sequence similarity than 

25 standard sequence similarity methods in particular for a novel chemokine PR0842 (designated CK27) encoded 
by DNA56855-1447. 

The fold recognition program ProCeryon was used for this analysis (ProCeryon Biosciences, Inc.). The 
fold library used consisted of 4,749 known 3D structures filtered at 95% sequence identity from the Protein Data 
Bank. Two mean force potentials describing the energetic forces between the residues of a fold and the energetic 

30 forces between the residues of the fold and the surrounding solvent were used to calculate the sequence-structure 
fitness (Domingues, et al., Proteins . 37:112-120 (1999); Sippl, M. J., J. Comput Aided Mol. Pes . 7:473-501 
(1993); and Sippl, M. J., Curr Qpin Struct Biol . 5:229-235 (1995)). Gap restrictions were used to control the 
number, size and placement of deletions and insertions in the query sequence and the fold library entries: fold and 
sequence deletion penalty, 300; fold and sequence penalty, 15; and fold and sequence maximum gap length, 30. 

35 Fold information was used as gap restraints: gaps were not allowed to be placed in secondary structure elements 
and in the core of the fold. A gap between adjacent residues in sequence was only accepted if the distance 
bewteen them in the structure is less than 7 A. To suppress the fragmentation on the sequence-structure 
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alignment, a minimum fragment size of 3 residues between two adjacent gaps was used. The BLOSUM40 amino 
acid substitution matrix was used for sequence comparison and scoring (Henrikoff , S. and Henrikoff , J . G. , Proc. 
Natl. Acad. Sci USA . 89:10915-10919 (1995)). 

A combination of four different scoring strategies was used for final scoring and ranking: i) a combined 
energy score derived from residue-residue and residue-solvent interactions (pair/surf); ii) a sequence similarity 
score (seq); iii) a normalized combination of both, pair/surf ands seq (Threading index; Th lax.); and iv) the ratio 
between the length of the fold (foldlength) and the number of aligned residues that took part in the sequence- 
structure alignment (pathlength). 

The 3D models generated for the high scoring hits and their respective sequence-structure alignments 
were analyzed graphically by using the Peep module in ProCeryon (ProCeryon Biosciences, Inc.). Structural 
classification schemes SCOP (Murzin et al., J. Mol. Biol. . 247:536-540 (1995)) and CATH (Orengo et al., 
Structure . 5: 1093-1108 (1997)) were used to find all members of a fold and look for the scoring of those in the 
ProFit output. Similar structures of a fold were checked in the top-ranking hits in order to confirm a fold 
annotation and to generate structure-based protein function hypothesis. 

Using the above described threading analysis, PR0842 (CK27) encoded by DNA56855-1447 suggested 
structural homology to the chemokine interleukin-8 (IL-8). Amino acid sequence identity between the two 
proteins is low, however they both have a CXC motif, which classifies IL-8 as a member of the CXC chemokine 
family. 



The expression pattern of CK27 was investigated in both human and mouse multiple tissues. Figures 
3 and 4 show the expression pattern of CK27 as demonstrated by hybridization to a human multiple tissu e^ 
expression array. CK27 was highly expressed in lung, stomach, and the trachea. In ovary, prostate, colon and 

f—, ... . — — - ., C_ „ ^, 

fetal Jung, weak expression of CK27 was detected. In situ analysis confirmed high expression of CK2 7 in normaj 
Jung and also showed expression in tissue samples from patients with inflammed l ungs and asthm a. The 
expression pattern of murine DNA56855 (CK27) is shown in Figure 5. In the mouse, significant levels of 
expression was observed in me Jung ^yroid,. submaxillary gland and the uterus. 

B. PR0842Chemoattracts Monocytes and Dendritic Cells 



Threading analysis of PR0842 (designated herein as CK27) suggested a functional role for this novel 
chemokine similar to IL-8. Interleukin-8 has long been characterized as a proinflammatory cytokine and 
chemoattractant for neutrophils. Accordingly, trans well migration assays were performed to demonstrate the role 
of PR0842 (CK27) as a potential chemokine. Peripheral blood mononuclear cells (PBMC) were isolated from 
human peripheral blood by Hypaque-Ficoll density centrifugation. Migration of PMBC towards PR0842 (CK27) 



EXAMPLE 13 



Novel Cytokine PRQ842 (CK27) Plays a Role in the Inflammatory Response 



A. 



Expression Patterns ofDNA56855 (CK27) 



(1) Methods 
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was tested in 24-well Trans well migration plates containing filters with 5 pores. 10 6 PBMC per well were 
used. After a 2.5 hour incubation at 37 °C and 5% C0 2 , the number of cells and cell populations migrated were 
analyzed by flow cytometry. 

It is interesting to note that threading analysis of PR0842 (CK27) demonstrated the possession of a CXC 
motif, suggesting that CK27 belongs to the family of CXC chemokines (see EXAMPLE 13). Structural homology 
5 was also found to DNA39523, which has been published as a novel chemokine (named MIP2-y) that also attracts 
dendritic cells (Cao et al. , Journal of Immunology . 165:2588-2595 (2000)). In addition to the presently identified 
chemokine PR0842 (CK27), MIP2-y and another related chemokine known as SDF-oc are the only CXC 
chemokines that attract dendritic cells. 

(2) Results 

10 As described above, threading analysis of CK27 prompted investigation of the chemoattractant properties 

of PR0842 (CK27). Results of trans well migration assays are shown in Figures 6 through 8. These studies 
demonstrate that PR0842 (CK27)^s pecificalj y chemoattracts monocytes and dendritic cells (see Figure 6). 
Different lots of CK27 show similar chemoattractant activity (Figure 7). Heat treatment reduces the 
chemoattraction of CDllc + (dendritic cells) and CD14* (monocytes) (Figure 8). These data show that CK27 

15 specifically attracts monocytes and dendritic cells. However, similar attraction of T-cells, B-cells or neutrophils 
was not observed. Both cell types (monocytes and dendritic cells) play an important role in the initiation of an 
immune response. Monocytes and dendritic cells have the ability of phagocytosing antigens, presenting antigens 
on their cell surface and activating T-cells by direct cell-cell interactions and secretion of cytokines. The present 
studies demonstrate that the novel chemokine PR0842 (CK27) identified herein plays an important role in 

20 leukocyte trafficking and may also function in the regulation of inflammatory responses. In addition, CK27 may 
play a role similar to interleukin-8 in a variety of homostatic and disease processes, including development, 
hematopoiesis, allergies, angiogenesis, and oncogenesis. As discussed below, studies involving human 
DNA56855 transgenic mice suggest that CK27 plays an important role in the development of ovarian cystic 
pathology. 
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9 

C Human DNA56855 (CK27) Transgenic Mouse Model Develop Cystic Ovaries f 



Human 56855 cDNA was ligated 3' to the pRK splice donor/acceptor site that was preceded by the 
myosin light chain using the protocol described by Shani, M., Nature . 314:283-286 (1985). The 56855 cDNA 
was also followed by the splice donor/acceptor sites present between the fourth and fifth exons of the human 
30 growth hormone gene (Stewart, T. A. et al. , Endocrinology . 130:405-14 (1992)). The entire expression fragment 
was purified free from contaminating vector sequences and injected into one cell mouse eggs derived from FVB 
x FVB matings. CK27 transgenic mice were identified by PCR analysis of DNA extracted from tail biopsies. 
Expression was determined by real-time RT-PCR (TaqMan*; Perkin Elmer) on total RNA from skeletal muscle 
biopsies. 

35 Mice expressing human DNA56855 (CK27) as a transgene under the control of the MLCH promoter 

developed cystic ovaries by 12 weeks of age (see Figure 9). Although in situ analysis showed relatively specific 
expression in lung bronchial epithelium (see EXAMPLE 12), CK27 transgenic mice had histologically normal 
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lungs. In addition, expression in tubule epithelium adjacent to the ovary (human tissue) has been observed in situ, 
which may be relevant to the changes seen in CK27 transgenic mice. 

Deposit of Material 

The following materials have been deposited with the American Type Culture Collection, 10801 
5 University Blvd., Manassas, VA 20110-2209, USA (ATCC): 

Material ATCC Dep. No. Deposit Date 

DNA56855-1447 203004 June 23, 1998 

These deposits were made under the provisions of the Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder (Budapest 
10 Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the date of deposit. The 
deposits will be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement 
between Genentech, Inc. and ATCC, which assures permanent and unrestricted availability of the progeny of the 
culture of the deposit to the public upon issuance of the pertinent U.S. patent or upon laying open to the public 
of any U.S. or foreign patent application, whichever comes first, and assures availability of the progeny to one 
15 determined by the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to 35 USC § 
122 and the Commissioner's rules pursuant thereto (including 37 CFR §1.14 with particular reference to 886 OG 
638). 

The assignee of the present application has agreed that if a culture of the materials on deposit should die 
or be lost or destroyed when cultivated under suitable conditions, the materials will be promptly replaced on 

20 notification with another of the same. Availability of the deposited material is not to be construed as a license 
to practice the invention in contravention of the rights granted under the authority of any government in 
accordance with its patent laws. 

The foregoing written specification is considered to be sufficient to enable one skilled in the art to 
practice the invention. The present invention is not to be limited in scope by the construct deposited, since the 

25 deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs that 
are functionally equivalent are within the scope of this invention. The deposit of material herein does not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 
aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the 
claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition 

30 to those shown and described herein will become apparent to those skilled in the art from the foregoing description 
and fall within the scope of the appended claims. 
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